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Preface 


A striking phenomenon in the recent evolution of the sciences is that there 
is scarcely a field of application in which the theory of probability does 
not play a role. In economics, finance, traffic and transport, signal and 
image processing, genetics, environmental sciences, as well as for mod- 
eling structures exposed to wind, waves, or earthquakes, and for the sci- 
ence of materials—the standard models make use of random variables and 
stochastic processes, and sometimes the finer points of stochastic analysis. 
Curiously, under the pretext that there is randomness in the model, engi- 
neers sometimes allow crude treatments that they would not accept in the 
analysis of deterministic models, as if random phenomena need not be 
treated with rigor. For instance, calculations with second-order moments 
and gaussian hypotheses are confused, extreme-value laws are mistak- 
enly applied to nonadditive quantities, although the asymptotic estimates 
‘are not compatible with nonlinear changes of scale, the method of statis- 
tical linearization is used without the necessary precautions, asymptotic 
laws are used instead of the actual laws without estimating the differ- 
ence between them, and so on. These misuses are encouraged by the cur- 
rent fashion for anything heuristic (because it resembles informal scientific 
investigation). The serious fault in these practices is that the results they 
give can neither usefully support a decision nor contribute to a discussion 
of the comparative risks of different choices, because there is no informa- 
tion about their accuracy. . 
The main motivation for this book is to give greater rigor to numerical 
treatments of stochastic models. This approach turns out to be scientifi- 
cally fertile, since it extends the domain of application of mathematics 
and allows proofs that are an important preliminary to implementation on 
computers. ude 
The quantities that we calculate in stochastic models are ue 
expectations (since probabilities of events, variances, and so 
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also expectations). If we forbid the crude methods outlined above wh 

methods are valid? First of all, in many cases we can compute expectation, | 

by analytic methods. It follows from the mathematical properties of i | 
model that the desired expectations must satisfy certain deterministic equa- 
tions, which we can solve numerically. Chapter 4, on Markovian problems 
illustrates this situation. In other cases, we can use simulation methods, i 
which we pick points from a space equipped with a probability measure 
and take the average value over these points of the random variable whose 
expectation we want to calculate. These methods can be applied to a great 
variety of situations, and for nearly all the mathematically well-defined 
random variables (however, see the examples of non-Riemann integrable 
variables in Chapter 3, Section A and Chapter 5, Section C). 

Simulation methods split into two families: those that use an imitation 
of randomness, the Monte Carlo method and the shift method (one based 
on the law of large numbers, the other on the pointwise ergodic theorem), 
are only absolutely rigorous if we suppose that the imitation of random- 
ness is perfectly realized, which allows us to establish results by prob- 
abilistic reasoning. The other family contains the so-called quasi-Monte 
Carlo methods, in which we pick sequences of points, not randomly, but 
so that they have an asymptotic property of distribution that allows us to 
use them to calculate an expectation by averaging (provided we go about it 
correctly). These methods are rigorous and give demonstrable results, but 
they are less widely applicable than the Monte Carlo and shift methods. 
In particular, we do not know how to apply them effectively in infinite 
dimensions. 

This book is based on a postgraduate course given by the authors 
at Paris 6 University. Like any work of applicable mathematics, it lies 
between the extreme points formed by a theoretical academic treatise, an 
elementary pedagogical account, and a specialized technical manual for 
engineers. Taking into account that computer systems that use both num- 
bers and symbols and implement simulation methods will soon appear, it is 
important to decide precisely what knowledge in this field should be kept 
on paper. This has led us to accent the sorts of problems raised by new 
methods and to point out openings for research. The book nevertheless 
remains introductory and presents the current state of the main methods 
and ideas, the cases for which they have been proved, and the boundary 
areas where they work less well and need improvement. 

We begin with a preliminary chapter in which we recall the main ideas 
from analysis and probability theory that are required in the sequel. Most 
of the notation is defined here. In principal, this chapter should not be read 
exhaustively; the reader can refer to it periodically in the course of later 
chapters. 


| 
| 
| 
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In Chapter 2, we explain the mathematical framework in which the 
ideas of simulation, both for Monte Carlo and quasi-Monte Carlo methods, 
can be correctly expressed. The fundamental notions here are those of 
narrow convergence and of a p-Riemann integrable function. We also 
define the discrepancy of a sequence and give equivalent conditions for 
uniform distribution. These fundamental ideas are applied in Section B to 
the Monte Carlo method. We have stated the results that are simplest and 
most useful for applications, and points of entry into the vast literature of 
deep results are given in the notes. We emphasize the importance of stop- 
ping conditions and asymptotic speeds and devote a section to showing, by 
explicit examples, thé crucial importance of stopping conditions in algo- 

rithms for stochastic approximation. Section C is devoted to quasi-Monte 
Carlo methods. We give the Koksma—Hlawka inequality in a simplified 
form and investigate the main families of low-discrepancy sequences. We 
also explain the use of low-discrepancy sequences in simulation (which, 
given care, is straightforward) and give a detailed discussion of the differ- 
ence in nature from the Monte Carlo method. We continue by suggesting 
avenues for research involving mixed methods lying between the Monte 
Carlo method and quasi-Monte Carlo methods. The chapter ends with a 
section on calculating conditional expectations where Monte Carlo and 
quasi-Monte Carlo methods are used for polynomial approximation and 
discretization. It is very surprising that this has not previously been cov- 
ered in the literature, other than in specialized works on numerical fil- 
tering. 

Chapter 3 deals with the simulation of the main families of random pro- 
cesses and fields. Since the canonical probability space on which we define 
a random process is most often infinite-dimensional, we must adopt a more 
precise approach than in Chapter 2; this will also be useful in Chapter 5 for 
the study of Wiener space. Despite appearances, the infinite-dimensional 
case is very frequent (most often, this means only that the dimension is 
not bounded, as when calculating the expected stopping time of a Markov 
chain), it is the appropriate domain of application for the shift method. 
Although based on classical theory (Birkhoff’s pointwise ergodic theorem 
of 1931), it seems that the considerable advantages of this method, as far 

as computers are concerned, have only been popularized since the 1988 
course at Paris 6 University, which developed into this book. The method 
and the principles for its implementation are accompanied here by results 
on the speed of convergence. There are many open problems, such as that 
of finding quick ways to estimate speed of convergence, particularly when 
the method is used in finite dimensions, where it can also be very useful. 
The section ends with an account of research on generic points for the 
shift. The rest of Chapter 3 is devoted to stationary processes and fields. 
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Markov processes 


objects of 
We give 


icone a pa point processes. Stationary processes, alth 
ch than in the past, remain a very useful tool for 
an account of them that does 
and present the simpler ideas concernin 
pa processes in the next chapter; h 
cian pan for applications in Paul Lévy’s idea of recursive sim- 
a AA a of the simulation of Brownian motion before it 
sane ou 2 a we give an introduction to current research on condi- 
F : rocesses with independent increments are valuable tools 
or modeling that generalize models using Brownian motion; we take th 
Opportunity to consider at the same time subordination in the sense 
Bochner, which is rich in applications for simulation, but little known to 
engineers. The chapter ends with an introduction to point processes. These 


are marvelous tools for constructing models in very diverse fields of appli- 
cation. 


Generally, Markovian models lend themselves nicely to analytic cal- 
culations where simulation methods should be used only as a last resort, 
Since we have given references to the classical works on Markov pro- 
cesses, we decided not to derive the more classical formulas, but rather to 
lay out the more useful tools for calculation without too much explana- 
tion. The sweeping algorithm, which is of considerable theoretical interest, 
is also an effective numerical tool, as we illustrate with an example. Its 
principle could be extended beyond the linear framework. The reduction 
algorithm is connected to a number of problems of optimal control and 
has recently found new applications for calculations involving “American” 
options in the mathematics of finance. It seemed to us to be of interest to 
introduce the carré du champ operator, which has a major theoretical role 
and yields useful formulas in certain applications, notably in finance. 

The fifth and final chapter approaches questions of a slightly higher 
mathematical level. It shows that, thanks mainly to stochastic differen- 
tial equations, the objects of stochastic calculus (i.e., defined in terms of 
Ito’s stochastic integral in relation to Brownian motion) are both at the 
root of many models of physicists and engineers, and at the same time 
particularly difficult to simulate, because they are profoundly irregular. | 
The treatment of stochastic differential equations and their discretization 
are standard; many improvements in this area can be found in the recent 
literature. However, it seemed important to give a detailed explanation of 
Wiener functionals, a field in which active theoretical research meets prob- 
lems of application. We also introduce a tentative axiomatization of r 
notion of a simulatable random variable and its main properties. eae 
symbolic computation at present deals only partially with stochastic ca 
culus, it seemed important for the future to give certain of the main form 


ough less 
: engineers, 
not require previous knowledge 


g their simulation. We return to 
ere we use little more than their 
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representations attached to stochastic integrals and stochastic diff i 
equations (the formulas of Hu—Meyer and Isobe-Sato). In the nee sce 
we show how the shift method can be applied to Wiener space pr me 
the paths (which does not work), but by using representations as a a 
of independent variables, that is, on product spaces. it 
We must emphasize that there are questions in probability theo 
related to numerical methods, that we do not address, or do not see 
fully, in this book. Notable among those covered further in the literature 


are: 


e classical methods in simulation (algorithms, reduction of variance, 


etc.) 
e convergence of empirical processes 
e stochastic approximation and stochastic control 


e computation of expectation with respect to an invariant distribution 


of a recurrent Markov chain 
e probabilistic methods for solving equations in physics 
e fine methods for discretizing stochastic differential equations 


e numerical aspects for nonlinear filtering 


But this book is introductory, and its main point is the numerical use of 
the shift method, which will without doubt be of prolonged use in the areas 
mentioned above. We thank the researchers of the Centre d’Enseignement 


et de Recherche en Mathématiques Appliquées de l’Ecole des Ponts et 
Chaussées at Paris who have practiced and perfected the methods given 
in the book, as well as the works on which we have relied. We would 


also like to thank Alexander Scott and Philipp Regalia for their help in 


preparing the English version. 


Nicolas Bouleau 


Dominique Lépingle 


Paris, France 
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CHAPTER 1 


Preliminaries 


The theory and applications of stochastic processes are constantly growing 
and it is becoming ever more difficult to write a book on this topic which is 
simultaneously advanced, self-contained, and of reasonable length. Since 
we are concerned only with the simulation of such processes, we have 
attempted to strike a balance between what deserves careful exposition 
and what we assume to be more or less familiar to the reader. 

For those who are not acquainted with what we have left out, we have 
gathered in this chapter the definitions and results that will be used in the 
sequel. We do not include proofs and we refer for complete exposition 
to the excellent recent books by Ikeda and Watanabe [1981], Dacunha- 
Castelle and Duflo [1982, 1983], Durrett [1984], Rogers and Williams 
[1987], and Revuz and Yor [1991]. In other words this chapter is best 
used for reference: take a quick glance through it and carry on, returning 


| whenever you meet some notation, definition, or theorem which is handled 


without particular explanation. 


A SET THEORY AND GENERAL TOPOLOGY 


A.1 Some Notations 

A.1.1 oe ee 
The set of real numbers is denoted by R, and we write R, = [0, cel, R, 
J0, œ[. For x,y in R, we set 


xA y= min (x,y) 
x V y = max (x, y) 
xt = X V 0 


x = (—x) VA) 


PRELIMIN Abe 
la) a eg | 
[x] = in tegral part of x (largest integer not exceeding x) 

{x} = fractional Part of x (x — [x]). 


Similarly, for x = scsi) and y= (y1,...,y4) in R, we m 


[x] = (xl, ...,[xal) 

{x} =({x1},..., {xg}) 
ARYE A Vi scan SLEW) | 
(x,y) = MY FES FX avy | | 

lx] = (x, x)". 


If xa y =x, we define 


1x, y]] = Pi, yı] X-X payal 
[Ex, y]] = ey, 11 x S [xa, yal. 


The notation ]]x, y]] will also be used when some of the x;’s are —co, We 
denote by x’ the transpose of the vector x and identify xy’ with (x,y), | 


A.1.2. 


A real number r is called an algebraic number if it is a solution to some 
polynomial equation 


yo +a\X+-->+ay,x"=0 | 
| 


with integral coefficients (a;; i = 0.24578). 


A.1.3 
À Gaussian integer is a complex number a+ ib, where a and b belong to 
Z. The ring of Gaussian integers is denoted by Z[i]. 


A.1.4 

If B is a subset of A, the difference of A and B is denoted by A\B, or more 
simply B° if there is no doubt about A. If A and B are disjoint sets, a 
will sometimes write A + B instead of A U B. For instance, the symmetric 
difference of the sets A and B is 


AAB = (AN B°) + (BN A‘). 


We 


me 


Q 
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To every subset B of A we assign a function 1p, to be called the indicator 
of B, defined by 


_fl1l ifxeB 
180) = 9 ifxé B. 


A.2 A Summary of General Topology 


A.2.1 
Let (E,.7 ) be a topological space. If A is a subset of E, we write A for 
the interior of A,A for the closure of A, and 0A for the boundary of A. 


A.2.2 

Given the topological space (E,.7 ), it is natural to ask whether or not 
there exists a metric d on E that induces the topology .X The topological 
space (E,.7 ) is said to be metrizable if such a metric exists. A metric 
space (E,d) is complete if every Cauchy sequence in E converges to a 
point in E. A topological space (E,.7 ) is a Polish space if it is separable 
and if there exists a metric d that induces te topology and makes E a 
complete metric space. 


A.2.3 

A normed vector space is a Banach space if the induced metric makes it 
a complete metric space. A useful theorem, for both finite- and infinite- 
dimensional real normed vector spaces, is the Hahn—Banach theorem: Let 
E be a vector space and F be a subspace of E. Suppose A: F — R is linear 
and there exists c > 0 such that |A(x)| < c|x| for all x in F. Then there 
exists an extension of À to a linear mapping from E to R with the same 


bound c. 


A.2.4 

Let E be a real or complex vector space and let (x, y) ~> (x, y} be a sym- 
metric or Hermitian form on E. This form is positive definite if (x,x) > 0 
for all x # 0. A real or complex Hilbert space is a vector space with a 
positive definite symmetric or Hermitian form, which is complete under 


the corresponding norm 
lx] = (x, x). 


Let (v;,i € 1) be a family of elements of a Hilbert space H such that v; # 0 
for all i. We can take the subspace of all linear combinations 
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CiVi °°° + Cri, 


for finite subfamilies (vj,;j = 1,...,n) of (vi) and real or complex coeff 
cients (c;). We say that the family. (v;) is total in H if the closure of this 
subspace is equal to H. i 


A.2.5 


Let C be a nonempty closed convex set in a Hilbert space H. For every 
x € H, there exists a unique p(x) € C such that 


|x — p(x)| = inf{|x — yl: y € C}. 


We call p(x) the orthogonal projection of x onto C. In particular, every 
non-empty closed convex set in a Hilbert space contains a unique element 
of smallest norm. 


A.2.6 

A topological space (E, 7 ) is a Hausdorff space if for every pair of 
distinct points a,b in E there are disjoint open sets U and V such that a 
is in U and b is in V. We say that E is locally compact if every point 
in E has a compact neighborhood. A subset of E is said to be relatively 
compact if its closure is compact. 


A.2.7 


For se N° the Euclidean space R° is a real Hilbert space with the usual 
symmetric form (x,y). The induced topology makes it a locally compact 
Hausdorff space. The quotient topology on the torus Ts; = R°/Z is the 
largest topology such that the canonical projection Ts: Rë > R°/Z is con- 
tinuous. This topology makes T; a metrizable compact Hausdorff space. 


A.3 Functional Analysis 


A.3.1 

A real or complex function f on a locally compact Hausdorff space E is 
said to vanish at infinity if, for every € > O there exists a compact set 
K CE such that | f(x)| < € for every x not in K. We denote by @o(E) the 
class of all continuous functions on E that vanish at infinity. 


A.3.2 ; 
Let (E, 7 ) be a topological space. We denote by @ (E) (resp., FER) 
CE), C (E; R%)) the set of real-valued continuous functions on E (resp. 


ee 
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R4-valued continuous functions, bounded continuous real-valued func- 
tions, bounded continuous R<-valued functions). 


A.3.3 

An algebra A of real-valued functions on E is a ring of functions such 
that if f,g € A then f +g € A,fg € A, and cf € A for every c € 
R. If E is compact, then every continuous function on E is bounded and 
we have the following Stone-Weierstrass theorem: If A is a subalgebra of 
Z(E) which separates points and contains the constant functions, then the 
uniform closure of A is equal to @ (E). There is also a complex version of 
the Stone- Weierstrass theorem for self-conjugate subalgebras (1.e. if f € A, 
then f € A, where f is defined by f(x) = f(x)). If E is a compact interval 
[a,b], we conclude that every continuous real function on [a,b] can be 
uniformly approximated by polynomials on [a,b] : this is the so-called 
Weierstrass theorem. 


A.3.4 

It is sometimes useful to have a criterion for when a subset of a function 
space is relatively compact. This is the purpose of Ascoli’s theorem: Let 
(E,d) be a compact metric space, let F be a Banach space, and let ® be a 
subset of the space % (E; F) of continuous maps from E into F. Suppose 
that for every € > 0 there exists 6 > 0 such that whenever x,y € E and 
d(x,y) < 6, then | f(x) — f(y)| < € for all f e ®. Suppose also that for 
every x € E, the set 


(x) = {Œf € D) 


is relatively compact. Then ® is relatively compact in Z(E; F) endowed 
with the sup norm. 


B PROBABILITY THEORY 


B.1 Measure Spaces 


B.1.1 

If..@ is a o-field (o-algebra) in a set Q, then (Q, Æ) is called a measur- 
able space and the members of. A, are called the measurable sets. We shall 
most often deal with topological spaces (E, Z ) and there is a smallest ©- 
field.Z in E such that every open set in E belongs to PB. The members 
of .Z are called the Borel sets of E and PB. is called the Borel o-field in 
E. We often denote this o-field by Z(E) or €. A mapping f : E >R or 
C such that f~!(V) is a Borel set for every open set V in R or C will be 
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called a Borel function. The set of bounded Borel functions on E will b 
denoted by Dé. i 


B.1.2 

Let (Q, Æ) and (Q, £’) be two measurable spaces. A function f : Q 
Q’ is called measurable if f) c Æ. If F= (fi, i € I) is a set of 
functions from Q into measurable spaces ((Q;,-4;),i € I), then (F) 
is the smallest o-field in Q which makes every f; a measurable function 
In the same way, if @ is a collection of subsets of Q, then o6(@) is the 
o-field generated by @, that is, the smallest o-field containing ©. 


B.1.3 

A collection Z of subsets of Q is an algebra if it contains Q and is closed 
under the set operations (1,°). A collection %4 of subsets of Q is a mono- 
tone class if 


{An} c X andA; cAc...imply U An € LC 


and 


{An} C @6 andA: > À, D ...imply N An € V. 


If ¥ is an algebra and %4 is a monotone class with € c @é, then 
o(@ ) c @. This is the monotone class theorem for sets. There is also a 
monotone class theorem for functions: Let HW be a vector space of func- 
tions from Q to R such that 


fi) le Æ, 

(ii) if (hann e N) isa non-decreasing sequence of non negative ele- 
ments of JZZ with lim, hn = h, where h is finite (resp., bounded), 
then hE HE. 


Suppose that .7 contains the indicator function 14 of every set A in some 
collection Z of subsets of Q, which is closed under finite intersections. 
Then .% contains every real-valued (resp., bounded real-valued) function 
on Q which is o(@ )-measurable. 


B.1.4 

A (positive) measure is a function u from a o-field . € to [0,9], 
is countably additive and has (Ø ) = 0. The space (Q,.4@,p) is then 
called a measure space. A measure p on (Q. Z) is said to be finite if 


co| ; which 
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ui = HD) < °° and ©-finite if Q is a countable union of measurable A; 
with u(A;) < ce. 


B.1.5 
An important role is played by the Caratheodory extension theorem: Let 


be an algebra in Q and let up: & — [0, œ] be countably additive and 
o-finite with u(@ ) = 0. Then u can be uniquely extended to a measure 


on HE). 


B.1.6 
Given a measure space (Q,./-6,11), we may associate with any A in .@ 


another measure space (A, N A, ula), where 


UaBNA=UBNA), Be-4; 


ul, is called the restriction of u to A. 


B.1.7 
Let f : (Q,.4) — (Q, 6’) be a measurable function. If u is a measure 


on (Q, Æ), then v = of ' is a measure on (Q, Æ’), which we denote 


B.1.8 
Suppose that, for every i in some (possibly uncountable) index set 


1, (QÏ, 4) is a measurable space. Let Q denote the Cartesian product 


OMe | | a 


iel 


The product ©-field 4 T) ig defined to be the o-field o(m',i € I), where 
ni : QU) — Q! are the canonical projections. If J = {1,2}, the product 


o-field is denoted by. 4!®@.@’. 
If u; and u are o-finite measures on (Q!,.!) and (Q2, 47), respec- 
tively, then there exists a unique O-finite measure on (Q'xQ?,.46!@.6’) 


determined by the relation 
U(A; X Az) = Hi(Aj)M2(A2), AE." 


This is called the product measure pı ® p2. 
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B.1.9 


The Lebesgue measure in [0, 1], [0, 1[, R} or R is denoted by dx or À I 
the proue spaces [0, 1], Rf, or T, = R°/Z", for s e N*, we shall de 
À; or (À1}° or dx or (dx). In [0,1] = [0,1]”, the infinite-dimensionay 
Lebesgue measure will be denoted by Vw. 

B.1.10 


Let x € Q and suppose that the subset {x} belongs to a o-field £ on Q 
The Dirac measure 8, on (Q,.%4) is defined by ) 


1 ifxeA 
6,(A) = 
A) O if x ¢ A. 


B.1.11 


Let (Q, Æ , u) be a measure space. Two measurable real or complex func- 
tions f and g on (Q, Æ) are said to be equivalent if f(x) = g(x) for u- 
almost every x. If 1 < p < œ, L?(Q,.4@, wu) is the space of equivalence 
classes of functions which satisfy 


| 1/p 
Ifl = | f ror 10) de 


If p = œ, we set 


IL = inf {K : yd : |f@)| > KP = 0}. 


The LP-spaces (1 < p < ©) are Banach spaces and the L?-spaces are 
Hilbert spaces with the Hermitian form 


oe | FOIE dun). 


| i ; | the 
If Q is R or any subinterval of R and p is Lebesgue measure, then a 
(equivalent classes of) continuous functions on Q with eae st 
are dense in every L’,1 < p < œ. This is also true if Q = R° and H = As 
for s e N°. 
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B.1.12 

If (E,.7 ) is a topological space with Borel o-field &, then a measure 
u on (E, € ) is called a Borel measure. Let (E, €) be a locally compact 
Hausdorff space and denote by (E) the vector space of real-valued con- 
tinuous functions on E with compact support (i.e. vanishing outside a com- 
pact set). Let A: Z(E) > R bea positive linear map (Af > 0 whenever 
f 2 0). Then (Riesz representation theorem) there exists a unique Borel 
measure on (E, Æ) satisfying conditions (i) and (ii) below: 


(i) If V is open, 
u(V) = sup{Ag;g € @(E),0< g< 1, supp(g) € V}; 
(ii) If A is a Borel set, 


UA) = inf {u(V); V open, A c V}. 


Such a Borel measure is called a Radon measure and, for every f in #%(E), 
we have 


j f= E du. 


Now let À : Z(E) — R be a linear map which is also continuous for the 
sup norm. It can be written as the difference of two continuous positive 
linear maps, so there exist two finite Radon measures u4, u2 such that 


Af = fs du - fs du. 


The countably additive mapping y : € — R defined by u = Hı — W2 
depends only on A and is called the bounded measure associated with A. 


B.1.13 
A function f on IR” is nonnegative definite if for any finite sets {x;,i € 1} 
in R and {o;i € I} in C, 


2TG — xj)O;O; 2 0. 


ijel 
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If, moreover, f is a continuous function, then (Bochner's theorem) ther 
exists a finite measure u on R@ such that és 


f(x) feum, re R’, 


In the discrete case, we have the Herglotz lemma: Suppose f : Z -> C 
satisfies i 


>. f(n- mnam 2 0 


nmel 


for any finite subset / of Z and any set (a, € 1} in C; then there exists 
a finite measure u on | - n, +n] such that 


fin) fnan, ne Z. 


B.2 Schwartz Distributions 


B.2.1 
Let D; = 0/0x; be the ith partial derivative applied to functions on R*. For 
a p-triple (p1,...,p,) = p of integers > 0, we let 


D? =DẸ' -D and — |p| = pi +---+ps. 


The space Z (or € *(R°*)) of all infinitely differentiable real-valued func- 
tions on R° with compact support is called the space of test functions. A 
linear map T : Z = R is a distribution if for every compact set K in R’, 
there exists a constant Ax and an integer m such that 


(Tel < Ax sup sup |D°@(x)| 


|pi<mxeK 
for every @ in Z that vanishes outside K. 


B.2.2 
If f is a Borel function on R*, which is À.-integrable on every compact 


subset of R‘(f e Z! (R°)), then the map 


loc 
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TF(@) = [roar., Qe Z 


is a distribution. If u is a Radon measure or a bounded measure on RS 
then the map 


TE) = foan pe Z 


is a distribution. If T is a distribution, we define a new distribution DIT, 
fori=1,...,5, by putting 


(DiT)(@) = —TWi), pe J. 


This notation is consistent with the ordinary derivation of functions since, 
if f is a &!-function, we have 


DiT¢ = Thp;f. 


We may also define DPT for any differential operator DP in a natural way. 


B.3 Random Variables 

B.3.1 

We shall mainly be interested in finite measures with total mass equal 
to 1. Such a measure is called a probability measure and the set of all 
probability measures on (Q, Æ) is denoted by .#,(Q,.<#). If P € 
M6'(Q, A), the triple (Q, Æ, P) is called a probability space. 


B.3.2 
A random element is a measurable function X from a probability space 


(Q,.%4, P) into a measurable space (Y, 4’). If (Q’,.6’) = (R, A(R) 
(resp., (R4,.Z(R%))), it is called a random variable (resp., a d-dimen- 
sional random variable). The distribution, or law, of the random element 
X is the probability measure P oX -1 = XP. The distribution function Fx 


of a random variable X is defined by 
Fy(x) = P(X e] - 0, xl), xe R. 


i i ‘ ` , ` 1» d á 1- 
We shall also use the distribution function of an R“ -valued random vari 
able: 
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Fx(x)= PX e]]- ~,x]]), xe Rt. 


A density function of a d-dimensional random variable (or of its distriby 
tion) is any function f such that, for every Borel set B in Rf, | 


P(X € B) = | fa) dx. 
B 
B.3.3 


Let X be a P-integrable d-dimensional random variable. The expectation 
or mean value of X is defined by 


EX = [x dP. 


This depends only on the equivalence class of X in L'(Q,.4, P; RÌ). If x 


and Y are two (equivalence classes of) random variables in L7(Q,.4, P), 
their covariance is 


Cov (X, Y) = E(XY) - EXEY: 


A d-dimensional random variable X is centered (has zero mean) if its com- 
ponents are P-integrable and EX = 0. 


B.3.4 


If X is a d-dimensional random variable with distribution uy, the char- 


acteristic function of X (or of uy) is the function x : R? — C defined 
by 


Ox(u) = Ee?) = [exe {i(u, x)}Ux(dx). 


The function px uniquely determines Lx. 


B.3.5 


A real Gaussian random variable X is a random variable with distribution 
Nm, v) determined by the characteristic function 


Ex(u) = exp {imu — vu}, ue R, 
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where m € R, v e R,. A d-dimensional Gaussian random variable is 
a d-dimensional random variable such that any linear combination of its 
components is a real Gaussian random variable. Its distribution is denoted 
by Na(m, 2), where m is the mean vector and £ the covariance matrix of 
the components. : 


B.3.6 
A collection of 6-fields (.F;,i € T) contained in .# is said to be indepen- 
dent if, for any finite subset J of Z and any Aje F;,j € J), 


r( a si) = | | Pap. 


Jed jeJ 


Random elements (X;,i € 1) on (Q, Æ, P) are said to be independent if 
the o-fields (G(X;), i € I) are independent. 


B.3.7 

A sequence of d-dimensional random variables (X;,i € I c N) is said to 
be 1.1.d. (independent, identically distributed) with common law u if the X; 
are independent and have the same distribution u. Such sequences satisfy 
the strong law of large numbers: if f |x|u(dx) < ©, then 


M dx = [suas P-as. 
i=1 


B.3.8 
A sequence of d-dimensional random variables (Y,,n 2 1) converges in 


distribution or in law, to the Rd-valued random variable Y (resp., to the 
probability measure u on R) if for every f in PRI), 


BIY) > ELY)] 


(resp., E[f(Y,)] — [f œ)u(dx)). | 

We shall write Y, -3 Y (resp, Yn > y). Notice that each Y, may be 
carried by its own probability space. A basic fact in probability theory 
is the central limit theorem: Let (Xn,n 2 1) be an iid. sequence of 
d-dimensional random variables with common law | such that 


p” 
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J Puan < 00, 


Set 


m= [nao 


pe | (x — m)'(x — md) 
gao Yxi n21. 
i=1 


Then nl 2(§,, — nm) converges in distribution to N4(0, D). 


B.3.9 

Let .Z be a sub-o-field of £. The conditional expectation given 3 of 
a random variable X in L1(Q,.4, P) is the random variable (up to a P- 
equivalence class) E(X|.2 ) defined by the two following properties: | 


(i) E(X|.Z ) is A-measurable, 
(ii) for every BE P, 


BdP. | 


f xav=] E(X 
B B 


For A e Æ, we define the conditional probability of A given A by the 
equation 


P(A|. 4%) = El] P) P-as. 


C RANDOM PROCESSES 


C.1 General Random Processes 


C.1.1 

We shall most often be interested in infinite-dimensional situations, where 
there is an infinite collection of random variables defined on the same 
probability space. This collection will be called a random field or random 
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process. The index set of these collections will be ei Bin 
or most often R+ or N. either R° or R or Z, 


C.1.2 

Let us begin by stating Kolmogorov’s extension theorem for product 

spaces, which is the main tool for constructing random fields with given 

finite-dimensional distributions. Let J be an infinite index set and for each 

ie J let E' be a Polish space with Borel o-field €. For sets F c I we 
denote by E™ the product space [],.- E and by # ® the product o-field 
Ier €". For sets I d GD F#Q, let TE denote the canonical projec- 
tion from E onto EP, Suppose that for every finite set F we are given 
a probability measure Pr on (EP, &) such that for any two finite sets 
G D F 4# Ø ,Pr=nx%.Pg. Then there exists a unique probability measure 
P on (E”, Æ) such that Pr = n}+P for all finite F4 Ø . 


C.13 

Let (E, € ) be a Polish space with its Borel o-field and let J be a set. We | 
write E’ for the product space [],.;£’, where E' = E for all i e J, and 
&®! for the related product o-field. A random process X with state space 
(E, & ) and parameter set J is a collection 


X=O-7er 


of measurable functions from a probability space (Q,.4, P) to (E, €). 
We can view X globally as a measurable map 


X : (9,4, P) = (E!, g9). 


Let Zy = X.P be the law of X. In this setup the canonical process asso- 
ciated with X is the process of canonical projections (Y;;i € I) defined on 


(E', a, Lx) by 


Y;(w) = w(i). 


C.1.4 

If X = (X;;ie Nand Z= (Zj;i€ | 
set Z, the same state space (E, 4 ) and same proba 
we say that X and Z are modifications of each other if 


) are processes with the same parameter 
bility space (Q,. é, P), 


Vie I, P(X; = Zi) = l. 


> 
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However, the right concept of two random processes bein 


. th . 
the following one: X and Z are indistinguishable if ome Same I 


P(Vi = I, X; = Zp = |. 


A subset A of Ix Q is said to b j is indistinon: 
e evanescent if 14 is indistinguish 
the zero process. sushable from 


C.2 Markov Chains 


| 

| 
C2.1 | 
Let (Q, F ) and (E, €) be measurable spaces. A Markov kernel from | 
(Q, F) into (E, &) is a mapping N : Qx € — [0,1] that satisfies the 
following properties: 


(i) for fixed œ e Q,C ~> N(@,C) isa probability measure on (E, #); 
(ii) for fixed Ce &,0 ~> N(@,C) is .7=measurable. 


To define a sub-Markov kernel we replace (i) by (i^): 
(i) for fixed œ € Q, C ~> N(@,C) is a measure on (E, &). 


C22 | 
If (Q,.A%) = (E, &), a Markov (resp. sub-Markov) kernel from (E, &) 
into itself is called a Markov (resp. sub-Markov) kernel on (E, & ). When 
dealing with a sub-Markov kernel P, it is convenient to adjoin an extra 
point 6 to E, to produce an extended space E’ = E U {6} with o-field 
&’=-~EU{BU {6}: Be &}. We set 


P(X, 10j)2 LPG) et: | 
P(6,A) = 14(8) Ae &’ | 


and the sub-Markov kernel P extends to a Markov kernel, which we also 
denote by P. The extra point 6 is called a coffin state for P. 


C.2.3 ee | 
Let P be a sub-Markov kernel on (E, # ) and let x be a probability measure 


in 
on (E’, &’). A sequence of random variables X = Ans 71 a Ee enh 
values in Z’ is called a Markov chain with transition function P and i 


measure T if 
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(i) the distribution of Xo is x; 
(ii) for n 2 1 and Be &’, 


P(Xns € B|o(Xo, ss) Ân)) = P(Xn € B|o(Xn)) = P(Xn, B) P — a.s. 


eor all finite-dimensional distributions of X are uniquely deter- 

Now suppose that (E, Æ) is a Polish space with its Borel o-field. We 
may use Kolmogorov’s extension theorem to get, for any sub-Markov 
kernel P and any probability measure x on (E’, €”), a probability measure 
P on (EN, (&’)®) such that the coordinate mappings (X,,n > 0) are a 
Markov chain with transition function P and initial measure n. If P is 
fixed, this probability measure is denoted by PA; we will write P, when 
m is the Dirac measure 6,. The random time C = inf{n > 0 : X, = 8} is 
called the lifetime of X, and 


P(@ : X;,(@) = à, Vn = C(w)) = 1. 


C.2.4 
In the setup defined above, we define the shift operator 8 on Q = Ey 


by the equation 


O(x0, X1, oe .) = (x1, X2, .. J 


for every random variable n, we define the random variables 8’n,n 2 0 


by putting 
(8”n)(&).= n(8”"(w)). 


y of the chain X can be given the following form: for 


The Markov propert ou 
every Be 6, 


every random variable n, every 12 0, and 
P(O" € B\O(Xo, - -»Xn)) = Py, (1 € B) Pr — as. 


j PLC alu: at x = Xn 
where the right-hand side is the function P,(n € B) evaluated at x n 
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C.3 Ergodic Theory 

C.3.1 

Let (Q,. 4,2) be a probability space. A measurable function 9 

(Q, £) — (Q,.4) is called a measure-preserving transformatio k 

P(67'A) = P(A) for every A in #4. A set A € Æ is almost ine : 

for @ if PCA |A) = 0. A measure-preserving transformation js Se 

ergodic if for every almost invariant set A we have P(A) = 0 or 1. es 

called mixing if for every A and Be .#, Is 


lim P(A N 0B) = P(A)P(B). 


Every mixing transformation is ergodic. 


C22 
Let @ be an ergodic transformation and let § be a random variable in 


L}(Q,.4, P). Then 


1 n—1 
im — j=E — a.s. 
lim : D eo) £ P- a.s 


k=0 
This is Birkhoff’s pointwise ergodic theorem. 


C33 
Let (Q,.4,P,®) be a Polish space with its Borel o-field, a probability 


measure P and a one-to-one measure-preserving transformation 6. Sup- 
pose also that the set of points œ in Q such that there exists an integer 
n > 1 with 0”9 = œ has measure zero. Then (Rohlin-Halmos lemma), for 
every € > 0 and every positive integer n there exists a set A E . @ such 


that 


(i) the sets A,0A,...,0" 'A are disjoint; 
(ii) P(U%,'0'A) > 1 — €. 


C.4 Discrete Parameter Martingales 


C.4.1 

Let (Q,.4, P) be a probability space and let (.F,,n 2 0) be an increasine 
sequence of sub-o-fields of . Z. The sequence F = (4n," 2 0) 1s calle 
a filtration. Any sequence of random variables {Z,,n 2 0} satisfying 
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(i) Zn is F,-measurable ne N 
(ii) E|Z,| <0 nen 
(ii) Ela) Fn) = Zm Pas. me n,m,ne N 


is said to be a (.7,)-martingale. Eau; _ 
martingale. Sare. Equivalently we say (Zn, Fnin=0) isa 
If (i) and (ii) are retained and (iii) is replaced by the inequality 


E(Zn| Fm) 2 Zm P-as. 


(resp. Eal Fn) S Zn P-—a.s.), then (Z,,.F%n > 0) is called a sub- 
martingale (resp., a supermartingale). 


C.4.2 
= (Zn, nin > 0) be a martingale (or submartingale, or supermartingale). 


sup E|Z,,| < 29; 
n 


then (Doob’s martingale convergence theorem) as n — ~,Z, converges 
P-a.s. to a limit Z.. with E|Z..| < oo, 


C.4.3 
Suppose that 1 < p < œ and p’ +q = 1. Let (M, Fin > 0) be a 
martingale such that 


sup ||Mnl|p < ce. 
n 


Then (Doob’s inequality) 
|| sup M, 1], < 4 sup ||Mnllp. 
n n 


Another useful inequality for martingales is the Burkholder inequality 
1 < p < œ, there exists an absolute constant C(p) such that 


pP 


o 1/2 
ll sup |Mn sal Molllp S C(p) (Sw ~ M) 


n-| 


b 
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C.4.4 = 
An (.F%,,)-stopping time T is a map T : Q —N=NU {+} with 


fo: T(@) <n} € Fn, neN. 


If (Zn, Fin = 0) is a nonnegative supermartingale, we put 


ZT = D Zn1{T-n) 
n=0 


and we have 


E(Zo) > E(Zr) 2 0. | 


C.5 Continuous Martingales | 


C.5.1 | 
We suppose now that the index set is R, (or a finite interval [0, T]). We | 


| 
1 


shall only deal with processes that have right-continuous paths, that is 
t ~> Z,(@) is right-continuous from R, to R¢ for every @ in Q. Two pro- 
cesses with right-continuous paths that are modifications of each other are 
in fact indistinguishable and will be identified. Let (Q, 76, P) be a prob- 
ability space endowed with a filtration (F,,t > 0), that is, an increasing 
family of sub-o-fields of 4. We define, for t 2 0, 


A filtration (72) is right-continuous if F; = F for any t 2 0. A d- 
dimensional random process Z = (Z;,t 2 0) is called adapted if Z; 1 
._F,-measurable for every t 2 0,. 


5.2 
A real-valued process Z carried by (,. 4, P) is called an (.F,)-martingale 
if 


(i) Z is an (75)-adapted random process; 
(ii) E|Z,| < 0, t2 0; 
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To get an (71)-submartingale (resp., (7;)-supermartingale) we h 
replace the equality in (iii) by the inequality > (resp., <). Recall ap . 
paths are assumed to be right-continuous. This right-continuity Fa i 
to transfer some theorems directly from discrete parameter martin e 
to continuous parameter ones. This is the case for Doob’s PA 
theorem: If sup, E|Z,;| < œ and Z is a martingale (or submartingale, or 
supermartingale), then Z; converges a.s. as t > œ% to Z.., and E|Z | < oo 
This is also the case for Doob’s inequality: If 1 < p < œ% p +q! z 1 and 
M is a martingale such that sup, ||M;||, < œ, then | 


|| sup |MrlIlp < q sup ||Mzllp. 
t 


C.5.3 
A real-valued process A is increasing if it is (¥;)-adapted, has Ap = 0 


a.s., and for P-almost every @ in Q, the path t ~> A,(@) is nondecreasing. 
With any increasing process A we can associate a random variable A% : 


Q — R, U {+} defined by 


Ac = lim A; P-as. 


to 


A finite variation process is the difference of two increasing processes. 


C.5.4 | 
Let M be a martingale with continuous paths such that for any ¢ in 


R,, E|M,|* < œ. We say that M is a continuous square integrable martin- 
gale. We define the quadratic variation process (M), to be the unique con- 
tinuous increasing process A that makes M 2 — A; a continuous martingale. 
Its existence is ensured by the Doob-Meyer decomposition of submartin- 
gales. If M and N are both continuous square integrable martingales, the 


quadratic covariation process of M and N is 
(M,N) = (M +N- (M-N). 


M,N) is a continuous martingale. We can 
Burkholder inequality: If 1 $ p < % 
hat for any continuous square 


It is easily seen that MN — ( 
now state the continuous parameter 
there exists an absolute constant c(p) such t 


integrable martingale M, 
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| sup |M; — Molllp < c(p) [](M)o0)'/" ln. 
t 


CSS 
An (F;)-stopping time is a map T : Q — [0, ce] such that 


{o:T(@)<t}e A t>0. 


Let X be a d-dimensional continuous adapted process and let F be a closed 
set in R. If we set 


Dr(@) = inf{t >0:X,(w) € F} 


with the usual convention that inf Ø = +c, then Dr is an (.Y;)-stopping 
time. Dr is called the debut or the entrance time of F. If X is right-con- 
tinuous adapted and G is an open set in Rf, the time 


Tc(@) = inf {t > 0: X,(@) € G} 


is a stopping time with respect to the filtration (Y;,). It is called the hitting 
time of G. Every constant c € [0,°°] is a stopping time. If S and T are 
stopping times, so is SAT. 


C.5.6 
A d-dimensional process X = (X;;t 2 0) carried by (Q,. , P, (.F;)) Is 
called progressively measurable if, for every t 2 0, the map 


(s,@) > X5(@) 


from [0,1] x Q to Rd is, #([0, 1) ®.F,-measurable. A right-continuous 
(Z,)-adapted process X is progressively measurable. Let 7 be an (41) 
stopping time. We define the o-field .77 by 


By =(Ae. @:AN(T St) € F; for all 120). 


wr ‘ ‘ # de «© , rt om 
If X = (X,;t > O) is progressively measurable and Xoo IS an) rand 


variable, then 
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XT(Q) = X7(@)(@)1 T<) (0) + Xoo(W) 1 {7-20} (0) 
is an .77-measurable random variable. 


Cat 
Let M be a (right-continuous) (7,)-martingale. If S and T are bounded 
(7,)-stopping times with S < T, then (optional sampling theorem for mar- 


tingales) 


- E[Mr| Fs] = Ms Pas, 


CAS 
A real-valued (right-continuous) adapted process X is an (.¥;)-local mar- 


tingale if there exist (.Y;)-stopping times T; < T2 <--- with T, — P-a.s. 
as n — œ such that for every n > 1 the stopped process 


i 
X; "= XtrTn 


is an (.Y;)-martingale. If X is a continuous local martingale, there exists 
a unique continuous increasing process A; such that X? — X6 — A; is a 
continuous local martingale. It is called the quadratic variation process 
of X and denoted by (X),. This definition is consistent with the one that 
we have given for continuous square integrable martingales. If X and Y 
are both continuous local martingales, we still define 


(x, Y)= 10K +) - &- ¥)). 


C.5.9 
Fort e R°,a partition T; of (0, t] is a finite ordered subset T; = {to,..., tk} 


of [0, s] such that 0 = fo < ty < ++ < tk = t. The mesh of T; will be denoted 
by 


(x) = max {|t — Hi = 9,...,4 - 1). 
> 0} be a 


Baa 


Suppose M is a continuous local martingale and let {m,n 
sequence of partitions of [0,f] such that lim, S(r’) = 0. Then 


B 


PRELIMIN gy, 
2 
> (Mr — Mn) 
j+l J 
J 


converges in probability to (M), as n —> ©. Consequently a “ONtinugy, 
local martingale has a.s. infinite variation In any interval [s, t] with (M) 
(M), a.s. 


C.6 Stochastic Integral 


C.6.1 


If the o-field . € is P-complete and each .¥; contains all P-null sets of | 


€, then the filtration (.7) is said to be P-complete. If M is a contin. 
uous local martingale, L7,.(M) is the space of progressively measurable 


processes H such that, for every t = 0, 


e(f Hd M) z =) =] 


If (7;) is P-complete and H € L} (M), there exists a unique continuous 
local martingale denoted (H.M), or fo H;dMy such that for any continuous 
local martingale N, 


| 
| 
| 
| 


(H.M, N) = H{M,N) 


where the right-hand side denotes the Stieltjes pathwise integral of the 
process H with respect to the finite variation process (M, NY. The process 
H.M is called the stochastic integral of H with respect to M. 


6.2 
A real-valued random process X is a 
be written X = M +A, where M is 
is a continuous finite variation proce 
quadratic variation process of X is 


continuous semi-martingale if it can 
a continuous local martingale and A | 
ss; this decomposition is unique. The 


(X) = (M) 


and the quadratic covariation process of two continuous semi-martingales 
X=M+A and Y=-N+Bis 


(X, Y) = (M,N). 
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C.6.3 

A progressively measurable process H is locally bounded if there exists 
a sequence of (.¥;)-stopping times T, < T» < --- with Ty — œP-a.s. as 
n —> œ and constants C, such that the stopped processes H!™ satisfy 


IHP|<C,, n=l. 


All continuous adapted processes H are locally bounded, and locally 
bounded processes are in L7,.(M) for any continuous local martingale M. 
Suppose that (7) is P-complete, and let H be a locally bounded pro- 
cess and X = M +A be a continuous semi-martingale; then the stochastic 


integral of H with respect to X is the continuous semi-martingale 


A.X=HM+H.A, 


where H.M is the stochastic integral in C.6.1 and H.A is a Stieltjes path- 
wise integral. We will also write (H.X), = f H;dX.. 


C.6.4 
If X and Y are two continuous semi-martingales, then (integration by parts 
formula) 


t 
X,dY, + | Y,dX, +X, Yð. 
0 


t 


X¥;=Xo¥o+| 
0 


This equation is a particular case of Ito's formula (chain rule for stochastic 
integrals): Let F € FR; R) and let X = (X!,...,X%) be a continuous d- 
dimensional semi-martingale; then F(X) is a continuous semi-martingale 
and 


t OF | t F Po 
= blade ar ees Saint ENS 
F(X,) = F(Xo) + > | weta ZI ga ŽO XX) 


C.6.5 
Let X and Y be continuous semi-martingales. The Stratonovich stochastic 


integral is defined by 
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. t L(X,Yh. 
[x O ay, [xd A 


: -di sional Semi-mart: 
hou xd) is a d-dimen Mi-Martin. 


BZR. X 
jf F e ER; R) and nal chain rule: 


gale, then we have the us 


£ OF i 
Det dxX,. 
raz Fe) + | ox; (Xs) © 
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D.1 Brownian Motion 


D.1.1 | l | 
Let (Q,.4, P) be a probability space. A d-dimensional Brownian Motion 
(B;,t 20) isa d-dimensional random process that has the following prop- 


erties: 


(i) (independent increments) if O < tọ < H°:° < fn, then B,,, Br - 
Bigs Btn — Btn- are independent; 
(ii) if s,t 2 0, then Bus — Bs has distribution N4(0, 1), where I is the 
identity matrix; 
(iii) with probability 1, £ ~> B; is continuous. 


If moreover Bo = 0 P-a.s., then the Brownian motion is said to be standard. 
If Bo = x P-a.s., where x e R? , then it is said to start at x. 


D.1.2 


If B is a standard d-dimensional Brownian motion, then the following 
properties hold: 


(i) (Homogeneity). For any s > 0, the process (B,,, — B,,t > 0) is a 
standard Brownian motion independent of o(B,,u < s). 


(ii) (S ; . i 
scape The process (—B,,t > 0) is a standard Brownian 
(iii) (Scaling). For eve 


ry c > 0, the pr 
Brownian motion. Process (cB,,2,1 > 0) is a standard 
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D.1.3 
In some cases, the interesting filtration will be larger than (0(B,, u < t), t > 


0), and we shall adapt our definition. A d-dimensional (.¥;)-Brownian 
motion is an R‘-valued random process that has the following properties: 


(i) (B;,t 2 0) is an (.Y;)-adapted process; 
(ii) if s,¢ 2 0, then B;+s— Bs is independent of .Y, and has distribution 
Na(O, fl); 
(iii) with probability 1,1 ~> B, is continuous. 


In the above definition, we may replace (ii) by the equivalent condition: 
Elexp {i6, Bus — B,)|F,}] = exp {-2|E|?/2} P- a.s. 


for all E € Rf and s,t > 0. 

D.1.4 

A Gaussian process with index set T is a collection of real random vari- 
ables (X;;t € T ) whose finite-dimensional distributions are all Gaussian. 
A real Brownian motion starting at x is a Gaussian process with moments 


E(B) =x t20 
Cov (B;,B;)=s at sS,t 20. 


If @ is a bounded real Borel function and t > 0, we have 


E(B) = | Ppt; x, y) dy 
where 


p(t; x, y) = Qnty? exp {-(x - y)?/2t} 
is the transition-density function of the Brownian motion. 
D.1.5 
Another important Gaussian process is the centered Gaussian process 


defined on T = [0, 1] with covariance 


Cov(X,,X,) = s(1 — t) if O<s<t<l. 


P 
28 RELIMIN a 
The easiest way to get such a process 1s to set 


X, = B; — tBi, Osrstl, 


where B is a standard one-dimensional Brownian motion. This immedi. 
ately gives us a continuous version of the process, which is called a Brow. 
nian bridge. More generally, one may consider, for x, y in R, 


XF” =x+B,+t(y—x— Bi). 


This process is called a pinned Brownian motion. It is easy to see that if 


P1,...; On, W are bounded Borel functions on R, and O < fi < +- < ta < 1, 
then 


Elo Œ + Ba) -pA(x + Ba Wx + B:)] 
2 [Et Pa XZVL; x, y) dy. 


In some sense, the law of X*” coincides with the law of the Brownian 
motion starting at x and conditioned on being at y at time 1. 


D.1.6 
A third useful Gaussian process is obtained in the following way. Let B 


be a standard one-dimensional Brownian motion. The process with index 
set R defined by 


V; = e Ben fe R, 


is called an Ornstein-Uhlenbeck process. It is a Gaussian process with zero 
mean and covariance 


E(V,V,) = eS], ste R. 


The probability distribution of V, is N(0, 1) for any t € R and for any 
bounded Borel functions @ and won R, we have, for s + t, 


ROUTES f como: 0, Dt = six dx dy 
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where, for u # 0, q(u; x, .) is the density function of the probability distri 
bution N(e7!#lx, 1-e? aly. probability distri- 


D.1.7 
Let B be a standard d-dimensional (F;)-Brownian motion. The following 


processes are (.7;)-martingales: 


(i) By ie {1,...,d} 
Gi) BB,-S8it ije{l,..,d) 
(iii) exp{(6, B;) — 1102} 0e R*. 
A nontrivial result is that a continuous process B with P(B5=0)=1isa 


standard d-dimensional (7;)-Brownian motion if and only if (i) and (ii) 
hold. Property (iii) can be extended to the complex case: 


exp {i(u, By) + Llu?) 


is a complex (.7;)-martingale for every u in RÊ. 


D.1.8 
Properties (i) and (ii) above have two important consequences: 
SD = flore {lenad } and so almost every. Brownian path has 
unbounded variation on every interval; 


° if (7) is P-complete, H.B = >), H'.B' may be defined for any 


d-dimensional progressively measurable H such that 


t 
(| HPas<=) = 1 Vt >20. 
0 


D.1.9 
Suppose that the filtration (75) is P-complete, and let B be a standard 


d-dimensional (.F,)-Brownian motion. Let T > 0 and let H be a d-dimen- 
sional progressively measurable process satisfying 


T 
P| f Pas < =l. 
0 


PRÈL | 
30 MINA, 


For 0<1t</7, set 


d t t 
F TD. D 
zan=op {$ f Mid IA asl 
i=] 


As a result of Ito’s formula, the process (Z;(H); 0 = 1 S T) is à One. 
dimensional continuous local martingale with Zo(Ħ) = 1. Itis a martingal, 


if and only if 
E[Zr(A)] = 1. 


In this case, we define a probability measure Pr on (Q, Fr) by | 


Pr(A) = EllaZr(H),, Ae Fr 


and the process B = (B!,..., B2) defined by 
~ . t . 

Bi = Bi- | Hias l<i<d, O<StST | 

0 | 


is a standard d-dimensional Brownian motion carried by (Q, Fr, Pr) 
(Cameron—Martin-Girsanov theorem). A sufficient condition for Z(H) to 


be a martingale is Novikov’s condition: 


| 
| 


T 
Efexp 4 |l |H;|7ds] < oo, 
0 


D.1.10 
In many electronic or mechanical systems, the sensitivity of the devices 


is limited by continual fluctuations, which we call noise. Thus it would 
be interesting to represent it by a centered Gaussian process with covari- 
ance 14,4. For such a process, any two random variables X,,X, with 
s # t would be independent. Now, there is no such measurable map 
(t, ©) ~> X,(@) since, by Fubini’s theorem, 


nn emma so 
PATAT SASADA 
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t 
i= [x as 
0 


would be a centered Gaussian random variable with E(Y?) = 0, and hence 
X, = 0A, @ P-a.e.; on the other hand, 


t 
elj xias a 
0 


which gives a contradiction. Such a process X may, however, be treated 
as a functional that acts on functions f in Lj,.(R4) 


| f(s)X, ds = | F(s)dB,, 
0 0 


where B is a standard Brownian motion. The process X is called the white 
noise process. It is the derivative of B in the sense of Schwartz distribu- 
tions. 


D.1.11 

Any continuous local martingale whose quadratic variation process tends 
to infinity can be time changed to a Brownian motion. We state the mul- 
tivariate version of this property, which is Knight's theorem. Suppose 
that (7) is a right-continuous filtration and let M = (M 1 ...,M®%) be a 
d-dimensional continuous local martingale with 


Mi=0 P-as. L= ree | 
(M'),,=00 Pas. i=1,...,d 
(M',M’),=0 P-as. 1<i4j<d, O<t<. 
Let 


Ti(s) =inf{t>0:(M'),>s} OSs<0, 1<i<d. 


Then the process B = (B',...,B“) defined by 
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B; = Mois) 


i ‘an motion. 
is a standard g-dimensional Brownian 


D.2 Wiener Space 
] continuous functions w : R} — Rd We 


PE . RÊ) be the set of al 
à a metric on (Rs; R4) by 


define a metric on 


p(w1, W2) = >)? “sup |wi (t) — w(t)|) A 1] wi, w2 € ER; Rd) 
n=1 RS | 


This space is complete and separable under the metric p. Let m= {T;;t > 
0} be the collection of canoni 


cal projections 


rw) = w(t), re R, we ER; R?). 


If. denotes the Borel o-field on &(R:; Rd), then on) = B, and every 
probability measure on (& (R4; R2), Ø ) is uniquely determined by its 


values on the sets of form: 


{w : (w(t}), . ..>W(tn)) E E}, 
where 0 < t; <---<t, and E e BR"), 


D.2.2 
Let B be a d-dimensional Brownian motion starting at x € Rf. As B 


À P -a.s. a mapping from Q into &(R,; Rf), we may consider its law 
A = i a which does not depend on the particular Brownian motion B. 
: sii : = Heed measure started at x or, if x = 0, shortly the Wiener 
ure. The ili . Rd ig , 
space. probability space (&(R;; R°), Z, Po) is called the Wiener 


D.2.3 

In some cases it wi . 

(0, 1] and pe oa more convenient to restrict the parameter set to 

restrict to the closed ed Soun and the Borel o-field JP, or even to 
sed subset &9([0, 1]; RI) of continuous functions that 


have w(0) = 0. Th 
=U. The Cameron— . i ; 
uous functions n-Martin space is the Hilbert space of contin- 
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which can be written 


FO= | f(s) ds, O0<t<l, 
0 


for some f in L2([0, 1]; RÔ). It has symmetric form 


1 . 
(f, 8)H = [ (s), g(s))ds. 


The Cameron—Martin space is a dense subset of @((0, 1]; Rd). 


D.2.4 

Now suppose that we are working in the canonical setting : the probability 
space (Q, Æ) is (ER: RI, B) and for any probability measure P, 
on (Q,.4), the process of canonical mappings (B;,t > 0) is a Brownian 
motion starting at x. For s 2 0 we define the shift transformation ®, by 
setting 


(8,@)(t) = os + t), t= 0. 
The canonical filtration on Q is {.7, = o(B;;s < t);t > 0} and we have the 
following Markov property: If s 2 0 and Y is a bounded random variable, 


then for all x e R4, 


EL 0 0l F] = Eg, Y P,- a.s., 
where the right-hand side is the function (x) = EY evaluated at x = B,. 
The following consequence is particularly useful: Let F be a closed set in 
Rf, let Z be a bounded F -measurable random variable, and let g be a 
bounded Borel function on R£. If Dr denotes the debut of F, then 

Drp = s + Df 0 9, on {Dr>s} 


and 


E,[Zg(Bp;)1 (Dr>s)l = ExTZ lip) EB, [e(Bpp)]]. 


CHAPTER 2 


Computation of Expectations in 
Finite Dimension 


In this chapter we lay out the mathematical framework for studying Monte 
Carlo and quasi-Monte Carlo methods, which are used to calculate the 
expectation of random variables defined on a finite dimensional space. 
We also consider the numerical computation of conditional expectations 


A MATHEMATICAL FRAMEWORK OF SIMULATION 


There is probably no mathematical subject which shares with probability the fea- 
tures that on the one hand many of its most elementary theorems are based on 
rather deep mathematics, and that on the other hand many of its most advanced 
theorems are known and understood by many (statisticians and others) without 
the mathematical background to understand their proofs. It follows that any rig- 
orous treatment of probability is threatened with the charge that it is absurdly 
overloaded with extraneous mathematics. [J. L. Doob, 1952] 


A.1 Narrow Convergence 


We begin by summarizing some facts about narrow convergence. 
Let E be a Polish space (a complete separable metric space), and let 
& be the o-field generated by the open sets in E. We call # the Borel 


o-field on E. 


A.1.1 Examples | | 
The sets [0,1], {0,1}*,fors € N*,{0, 1]§, {0, 1 }N, all endowed with their 


product topologies, are Polish spaces and compact Hausdorff spaces. R 
and R° are locally compact Hausdorff spaces. For d € N°, ZR, R ) 
is a Polish space that is an infinite-dimensional Fréchet space (complete 
metrizable locally convex space), 
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rune sire On the set 4 \(E, &) of probability measures on 
(E, €), ow topology is the coarsest topology for which the map- 


pings: 4 œ> u(f) are continuous for all f i 2 
continuous functions on (E, #). f in the set @4(E) of bounded 


A.1.3 Exercise 
Let (Un)n2z1 be a sequence in ZZ ne 
(Un(f))n>1 iS Convergent for every bou 
is some u in .4!(E, €) such that u 
continuous bounded f. 


€). Suppose that the sequence 
nded function f. Show that there 
n(f) converges to u(f) for every 


A.1.4 Exercise 


Show that there exists a countable subset A of C (E) such that, if 
Un, U E -C6\(E,&) and u,(f) > w(f) for all f in ZE, then yn > u 


narrowly. We call it a convergence-determining set. 


Definition A.1.5 A subset 4 of .Æ1(E, &) is called tight if, for 


every € > Q, there exists a compact subset K of E such that U(E\K) < € 
for every u in Æ. | 


Theorem A.1.6 (Prokhorov) Let .-# be a subset of .44!(E, & ). The 
following statements are equivalent: 


(i) every sequence of elements of .~@ contains a convergent subse- 
quence, 
(ii) Æ is tight, 
(iii) .@ is relatively compact. 


A2 Narrow Convergence and u-Riemann Integrable Functions 


Definition A.2.1 Let u be a probability measure on (E, &). A real 
bounded Borel function f on E is called -Riemann integrable if, for every 
€ > 0, there exist continuous bounded functions u and v on E such that 
u <f < v and |(v — u)dp < €. If f is .724-valued, then f is said to 
be u-Riemann integrable if its components are real u-Riemann integrable 
functions. 


A.2.2 Exercise a p 
Let f be a Borel bounded function from (E, & ) into .#2%. Prove that F 


. » A d 
is -Riemann integrable iff for every € > 0, there exists g in CE, 4") 
and h in (E) such that 
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Theorem A.2.3 Let (U,),>; be a sequence of probability measures on 
(E, E ), which converges narrowly to a limit u. Then | fdu, > [fdu for 
every -Riemann integrable function f. 


Proof We may assume that f is a real function. Given € > 0, let ü ve 
Cp(E) satisfy u < f < v and f(v — u)du < €. Then = 


| ua, < ffau < vam 


The left- and right-hand terms converge to f udu and f vdu respectively, 
so 


Tim [du — lim [fay < € 
Since € is arbitrary, we deduce that | f du, converges; and from the inequal- 
ities we see that the limit is ff dy. 


Theorem A.2.4 Let f be a real bounded Borel function on (E, &). 
Then f is u-Riemann integrable iff the set of its points of discontinuity is 


u-negligible. 


Proof Let us introduce the lower and upper regularized functions: 
f(x)=lim f(y) and f(x) = lim f(y). 
x yx 


Then f ss f , and f is continuous iff f = f . We note that f is upper semi- 
continuous (usc) bounded and f is lower semi-continuous (Isc) bounded. 
(a) Suppose that the set of points of discontinuity of f is u-negligible. 


Then 
fim- frau- fiau 


Since f is lsc bounded and f is usc bounded, there exist sequences 
(Pr), (Yn) in Z(E) such that @, T f and y, 4 f. Therefore, 
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Jour fian- fray 


and 


J vnc à fan. [ra 


We deduce that f is u-Riemann integrable 

(b) Conversely, suppose that f is u-Riemann integrabl 
let u and v be bounded continuous functions such fies i 
|v - u) du £ €. From the continuity of u and v we have ‘ 


Given € > 0. 
<f < y and 
u<f<f<f<v, 


and therefore 


fé -Pau<e 
Since € is arbitrary, we deduce that u{x: I (x) < f (xX)}= 0: LJ 
A.2.5 Examples 
(1) Let (X,) be a sequence of i.i.d. Rd-valued random variables such that 


E|X,|? < œ, and let 


m= EX, 
and 
D = E[(X, - m)'(% - m)l. 


Let Sra Xp te: + Xn and u = Na(0, D). Then 


Sn — nm 
Joe e Ab OMA), 
nfs ed 


i igi £ besgue-negli- 
for every Borel set A € R4 with p-negligible boundary (Lebese 


gible if the matrix D is regular). 


yp 
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(2) Let (X,),>1 be a sequence of R-valued random variables. For x - 
(xı, see Xd), let 


d 
Fx, (x) = e( BEL: <x) 


i=1 


be the distribution function of X,. If (X,) converges in law to X (i.e. 
P oX,! converges narrowly to P o X~'), then (Fx,) converges to Fy at 
each continuity point x of Fy. 


A.2.6 Exercise 
Show that, conversely, if (Fx,) converges to Fy at each point x in a count- 
able dense set in Rf, then (Xn) converges in law to X. 


A.3 Application to Simulation 


There is a well known mathematical device to map a square onto a line segment 
in such a way that area goes into length. All we need to do is to write our 
coordinates in the square in the decimal form: 


A = 070) 420d B = Pi Bis <4 Bye ss 
and to put 
y= -01B102B2...OnBn...; 


and we obtain a mapping of this sort which is one to one for almost all points 
both in the line-segment and in the square. [Norbert Wiener, 1947] 


The following result is sometimes called the fundamental theorem of sim- 
ulation. 


Theorem A.3.1 Let E be a Polish space with Borel o-field 4. For 
every probability measure u on (E, & ) and for every s in N*, there exists a 
Borel function f on [0, 1]° such that f is A,-a.e. continuous and pt = Asof K | 

Here À, denotes the s-dimensional Lebesgue measure on [0, 1}. We 
state a preliminary lemma before proving the theorem. 


Lemma A.3.2 There exist Borel functions A: [0,1] — [0,1]" and 
k: [0, 1] — [0, 1] such that ho k = Id outside a countable set A c [0, 1> 
and A is continuous outside a countable set B c [0, 1]. 
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Proof of the Lemma We define the Mappings 
o hı: [0,1] — {0,1} 

e Mm: {0,1}% — ({0, 1}N)N 

o h3: ({0, 1} ) — [0, 17N 


by 


e (H()}h=1 VaneN 


o (hi&)n=qn Vne Nifx<] where i 
i y: s (qn) is the canonical bi - 
pansion 5 — 2. (Qn/27), with the D Er Card {n: 2 = ee 
e hz is induced by any one-to-one mapping from N onto Nx N 
° h3((qm,n)m,nz0) = re (Ginn 27) once: 


In the same way, we define the mappings 
© ky: {0, LA -=> [0, 1] 


e ko: ((0,1}N)N > {0, 1} 
e kz: [0, 1] — ({0, 139 


by 


© k3((Xn)nz0) = (Ai Xn))nz0 
e k = h;' 
© ki ((dn)n>0) = eer (ga): 


The assertion of the lemma is now easily verified by setting: 
e h = h3 O M O hy 


e k= kı O kz O k3 
e A=(k20 k3) !((h1([0, 1]))°) 


and taking B to be the set of dyadic rationals in [0, 1]. 


We can now prove Theorem A.3.1. 


Proof of the Theorem 
know [cf. Bourbaki, Topologie 
Polish space is homeomorphic to 
assume that 


It is enough to consider the case s = 1. We 
Générale, Chapter 9, paragraph 6] that a 
a Borel subset of [0, 1]N, so-we may 


J 
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(E, €) = (0, 1]*,.A((0, 1]%)). 


If h,k,A, and B are as above, let us define a new countable set 
C=AU h(k(A‘) N B). 


This satisfies k(C°) N B = Ø and h o k = Id outside C. Let U be . 
probability measure on [0, 1]N; we decompose u as . 


= lx -U + lc.u. 


The measure v = 14.1 can be simulated in the following way: for y € 
[0, u(C°)], let 


u(x) = inf {y e [0,1]:v o k '({0, y]) ex}. 
It is well known that, for every Borel set D in [0, 1], 


Au '(D)) = v(k-'(D)), 


and in particular 
Ai (u~ (B)) = v(k-"(B)) = w(C* N k~! (B)) = 0. 


Since u is non decreasing, fı = Aou is A,-a.e. continuous, and A; of;! =V 
on C°. Now 1c.u is supported by a countable set and easily simulated by 
a mapping f from ]Ju(C°), 1] into C that takes constant values on intervals 
with suitable lengths. The function f, defined to be fı on [0,u(C‘)] and 
f2 on Ju(C*), 1], has the required properties, so we are done. O 


A.4 Distribution Functions | 
For x = (x1,...,x,) and y = (y1, ..., Ys) in R°, we write x < y tee Ay 
We write [[x, y]] for the the s-dimensional interval []*_, [x;, yi] in R’, while 
]]x, y]] denotes [];_, ]x;, yi]. This last notation will also be used when one 


or several of the x;’s are —oo, o- 
Let u be a probability measure on (R°,.#(R°)) with distribution fun 


tion 
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Fast co, x]]), xe RS, 


Lemma A.4.1 


Fu 1s nonnegative, nondecreasing in each variable and 
right-continuous by upper sector: if g = (Oy,..., Œs) and 
(*) X = is... Xs), 
limy Lou xs Lo, Fax) = Fu(o), 
HIM, Too. x Poo Fu (x) = 1 
4 PTS FuG)=0 forevery i=1,...,s. 


By setting, fora < b, 
eE a = 
(**) AO ER E Hasse gee) = PRG EA WE oo Rel, 
foray < bisina <br, we get 
[++ ME Wre a je 20. 


The proof is straightforward. 


Lemma A.4.2 Conversely,.if a function F from R° into [0, 1] satisfies 
(*) and (**), then there exists a probability measure u on R° such that 
F = Fi 


Proof A probabilistic proof can be found in the books by L. Breiman 


[1968], E.J. McShane [1974] or A.N. Shiryayev [1984]. We give a proof 
that relies on the theory of Schwartz distributions. Let T be the distribution 


and let œ be a positive element of the space Z of test functions. Consider 
the convolution 


(œ * F)(x) = | a y)F(x — y) dy. 
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as | 


Then Jf, œ * T dA; > 0, and T is a positive measure. Suppose in addition 
that fo dA, = 1 and support (a) c ] — +, O[*, and define: | 


O,(x) = n‘o(nx). 
By (*), On * F J F as n > o, Using Fubini’s theorem, we get 


]JJ-%,x]] 


= Í IanO [au = y)T (dy) dz 


= Í T(dy) | L- x (u + Y)On(u) du 
= (Q, * T(] — ~, DE). 


The nondecreasing function Fr(x) = T(]] — +, x]]) is a.e. finite, and thus 
everywhere finite and right-continuous by upper sector. | 
From ©, * F = O, * Fr, we deduce that F = Fr by letting n tend to | 
E | 


infinity. 

Lemma A.4.3 Let (F,)h be a sequence of distribution functions 
on [0, 1]°, and let x be the distribution function n(x) = x; ---xs of the 
Lebesgue measure À, on [0, 1]5. If F, — x in L'([0, 1]°, As), then Fn > 7 
uniformly. 


Proof (a) Let F be the distribution function of a probability measure 
u on [0, 1] and let a € 10, 5[. If, for some b > 0 and some xo € [a, 1-a}, 


we have F(xo) < M(Xo) — b, then 


[irc = m(x)|dx 2 Jaw = (U(x) = b))* lioro) dx. 
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re get an analogous inequality if F (xo) > Txo) + b, and deduce that if 
; at i 


oe 1(xo)| 2 b for some b > 0 and 
Le C(a, b) > 0 such that some xo € [a, 1 — a}, then there 


| |F(x) — ™(x)|dx > C(a, b). 


Therefore, for every b > 0, the inequality 


[irc — W(x)|dx < C(a, b) 


implies that, for every x € [a, 1 — a], 


| |F (x) — rx) < b. 


(b) Let À = La, 1 — a)’ and assume |F - n| < € on A. If x; < a for some 
ie-il,:2:58}, then 


|F(x) — rx) < W(x) + F(x) < À (AT) + WA). 
Otherwise, put y = (x; A (1 — a),...,Xs A (1 — a)). Then y € A and 


|F(x) -T| < |E) — np) + Inra) - By) + Fœ) - FO) 
<€+A,(A°) + LA). 


An easy computation gives 


u(A) = u([0, 1 — a]*) — u([0, 1 — a}\A) 
<(1-aÿ -e-s[e+a(l-a)""], 


and we conclude that 
S 5 s—] 
IF = nl. < (2+5)e+(41-(41-2a))+(-(-a))+sa(l a). 


| | ; adi and (b). 
(c) The assertion of the lemma now follows readily from (a) 5 
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A.5 Uniformly Distributed Sequences and Discrepancy | 
Definition A.5.1 A se 
De quence § = (&,)n>1 in [0, 1F is Called yy; 
distributed (u.d.) in [0, 1]5 if for every X= (%1,...,X;5) in [0 ir PRE 


TRES 
van >II li SMA), 


n=1 i=] 


Th 
€ rate of convergence can be measured by several discrepancies: 


DŽ(&,N)= sup 
xe [0,1] 


T(x) — — Dy Lirox (On) | 


Ie. Xi) -FÈ mn) 


D.(§,N)= sup 


x,ye [0,1]Sx<y 


TX) — N > Lox (En) 


S 
$ 1 N 

l ixsy} (Te =x) = N > ns) 
Ei n=! 


It is easily checked that, for every C. 


DE, N) = 


1 <p < co, 
LP(dx dy) 


 <DNEN) <1 
2N al ) 


and 


DEN) < DE, N) < BDE(E,N). 


Theorem A.5.2 The following statements are equivalent: 


(i) The sequence € is u.d. in [0, 1}°. 
(ii) For every p in [1, °°], D5(6, N) > 0 as N > 00, 
(iii) There exists p in [1,°°] such that D* pit N) = 0 as N >. 


(iv) The probability measure uy = (1 IN) To |, converges narrowly 
to the Lebesgue measure A, as N — œ, 


\ 
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proof (i) = (ii). If § is u.d., the distribution function Fy of 
verges pointwise pr T. By Lebesgue’s theorem on dominated aa one 
Fw converges in L (dx), and also uniformly by Lemma A.4.3 hi 
äi) > (iv). It also follows from Lemma A.4.3 that if D*E N) > 0 
then Fn > © uniformly. Therefore f gdux — | gd, for every finite sum 
of indicator functions of s-dimensional intervals. Since every an 
function can be uniformly approximated by such functions, we get (iv) 
(iv) = (i). The boundaries of s-dimensional intervals have Lebesgue 
measure Zero, SO their indicator functions are Riemann integrable. 


Proposition A.5.3 (Hermann Weyl 1926) The following statement is 
also equivalent to conditions (i) to (iv) above: 


(v) For every m + (0,...,0) in Z5, 
1 À 
N > exp(2ir(m, En)) — 0 as N >œ. 
n=1 


Proof (iv) = (v). The functions x ~> exp(2in(m, x)) are bounded and 
continuous on [0, 1]. 

(v) = (iv). We have pn(f) — As(f) for every linear combination 
f of exponentials. These functions generate an algebra, which is closed 
under complex conjugation, contains the constant functions, and separates 
points in the s-dimensional torus T;. By the Stone—Weierstrass theorem, 
the assertion is true for every f in %p(Ts), for every Riemann integrable 
function on T,, and therefore for every bounded continuous function on 


[0, 1}°. 


A.5.4 Examples 

(1) Let 0),..., Os be irrational numbers such that l, Oj 5.205 Oy are lin- 
early independent over the rationals. If we denote by {x} the fractional 
part of the real number x, then the sequence ({no,},..., {mOs}) is u.d. in 
[0, 1]°. | 
(2) Let (pn)nen be the increasing sequence of prime numbers. For every 
irrational œ, the sequence ({Opn}) is u.d. in [0, 1]. 

(3) The sequence ({v/n}) is u.d. in [0, 1]. | 

(4) If œ is irrational, the sequence ({an*}, Lo(n+1)},..., {a(n+s— 1) }) 
is u.d. in [0, 1}. 

(5) For À,-a.e. 0 > 1, the sequence ({0"}) is u.d. in [0, 1} (Koksma). 

(6) For A,-a.e. k > 0, the sequence ({k(5)"}) is u.d. in [O, 1]. 
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A.5.5 Extension to u-Distribution 
The notion of uniform distribution can be extended to a more gen 
setting. Let u be a probability measure on a Polish space (E, gl 
sequence § = (€,) in E is called p1-distributed if the sequence of me 
ures (1/N) >, dz, converges narrowly to LL. ia 
For instance, inside every nonempty interval of R,, there exist 2X, 
numbers x such that ( {x(3)"}) is L-distributed, with u = [k/(k + DIR 
[1/(k + 1)]8o and k = log 2/ log 3 [G. Choquet (1)]. a 


B THE MONTE CARLO METHOD 


The basic idea of the Monte Carlo method is to use pseudo-random 
sequences to simulate random behavior and then apply the Strong law 
of large numbers in an appropriate fashion. Quasi-Monte Carlo methods, 
which we will study in Section C, use sequences that satisfy only: the 
weaker property of equidistribution. At the end of Section C, and later 
in Chapters 3 and 5, we will compare the different areas of application, 
rates of convergence, and stopping criteria for the different methods. 
Obtaining sequences of real numbers that imitate perfectly the behavior 
of random sequences is not only difficult, as can be seen by the history 
of the successive attempts [Dellacherie, 1978], but logically impossible 
[Martin-Lof, 1966]. Nevertheless, the art of constructing pseudo-random 
sequences is at now very advanced (see the references in Section C.5.3) 
and yields excellent generators which are generally easy to implement. 


Ich bin Zarathustra, der Gottlose: ich Koche mir noch jeden Zufall in meinem 
Topfe. Und erst, wenn er da gar gekocht ist, heisze ich ihn willkommen, als 
meine Speise.! [Friedrich Nietzsche, 1883] 


B.1 Algorithms for Simulation 

Before we can use the Monte Carlo method to compute the expectation of 
a random variable F, we must first define a random variable with the same 
law as F on the probability space ([0, 1]”,.27([0, 1]”), dx”) for some m 
in NU {+}. Simulation algorithms are techniques that yield a random 
variable with a given law on ([0, 1]”,. A (0, 11”), dx"),m e NU [te]: 
We can rephrase this in the following language: A Borel sequence 18 4 
sequence (U,),>; of independent random variables, where each Un takes 


+ e, I 
II am Zarathustra the godless: I cook every hazard in my pot, and when it is well done, 


welcome it as my dish. 
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values in [0, 1}° with law 4x. How 
of independent variables each with 

There are general procedures to solve this 

iti api robl ; 

sion, composition, and rejection methods, as well as ee as ee 
rithms for specific cases. It is beyond the pecialized algo- 
an account of these techniques for which 
Morgan [1984], Ripley [1987], and especial 
L. Devroye [1986]. pecially to the reference book of 


Let us emphasize the fact that these techniques, as we shall 
can also be used with quasi-Monte Carlo methods instead of the Monte 
Carlo method (as long as the copy of F is regular enough to be Riemann- 
integrable, see the discussion in Section C.5.3 and Chapter 3, Section A.2). 

We restrict our attention to the rejection method [Von Neumann, 1951], 
which is sometimes called the acceptance method. Let X be a random 
variable with values in R and density function f. Assume that we know 
another distribution function g, called the envelope, from which it is easy 
to sample, and that there exists k > 1 such that f < kg. Then the random 
variable X can be simulated in the following way. 


Can We obtain from (U 


n)n>1 a Sequence 
the same law as F? 


> 


see below, 


(i) We simulate a random variable H with density g. 


(ii) Independently, we generate a variable U with uniform distribution 
on [0,1] 


e ifkg(H)U < f(A) set Y = H 
e if kg(H)U > f(H) return to (i) and repeat, independently of 
what happened before. 


It can now be proved that f is a density function for Y. Indeed, the point P 
with coordinates P, = H,P2 = kg(H)U is uniformly distributed under the 
graph of kg. By the following lemma, if we restrict P to the cases when 
it lies below the graph of f, its distribution becomes uniform below this 
graph and therefore the distribution of P; has density function f. 


i dent random vari- 
Lemma B.1.1 Let (Pn)n>1 be a sequence of indepen adom 
ables with the same distribution y on RÉ. Let A be a Borel set in ai zs 
that (A) > 0, and let T = inf {n > 1:P, € A}. Then Pr has distributio 


(1/w(A))u|, and E(T) = 1/p(A). 


Proof Exercise. 


y 
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B.1.2 Exercise | 
By slightly modifying the rejection method, show that it is Possit} | 
ę to 


simulate a distribution with density function f even if f is known 
by approximations f, and f, such that for any n, fn < f < a a | 
sequence fn — fn tends pointwise to zero. This property allows à es 

i 


field of application for the rejection method. 


B.1.3 Exercise 
A generator of pseudo-random numbers is a device that Provide 
$ 


sequences of numbers imitating Borel sequences. Most generators are bui] 
on congruence methods. Let us assume that a sequence of such generator, 
is given, and that, for every k, the kth generator, when used N k times, Sup. 
plies the numbers {n/N,;n = 0,...,Nx—1} in an arbitrary order. As soon 
as the kth generator has been used N; times, we move to the next gener. 
ator, and we record all the results one after the other to create a sequence. 


Show that if the sequence (Nz)>1 is increasing and satisfies 


lim — #1 = 9 
klo Ni +--+ N} 


then the resulting sequence is uniformly distributed in [0,1]. 


B.1.4 Exercise: Wiener-Siegel Polychotomy Algorithm 

In their interpretation of quantum mechanics, N. Wiener and A. Siegel 
[1953, 1955] consider the reduction process of the wave packet to be a 
random sampling, that is, a simulation. Let (‘P;) be the sequence of eigen- 


functions of the observables and, for X., |a;|* = 1, let ¥ = >; ail; be the 
unit vector that represents the wave function. Show that we can simulate a 


random variable N with distribution >. |a,|?8, as follows: first draw a 
sequence of bivariate normalized Gaussian variables (g&n) and then select 


the index n that achieves the maximum of |a,|/|¢,|. 


B.2 Glivenko-Cantelli Theorem 
Due to its wide usefulness, the following theorem is sometimes called the 
fundamental theorem of statistics. 


Theorem B.2.1 Let (U,),> be a Borel sequence in [0, 1]. We defin? 
the empirical distribution function of (Un) by 
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LÀ 
T == > 

n(x) N litos (Un), N21. 
n=] 


(Recall the notation T(x) = A,([[0, IDI Then 


(i) a.s., the empirical distribution 


id 
Uy = N > du : 
n=l ` 
converges narrowly to Às; 
(ii) 


sup |ty(x) — T(x)| — 0 as. as N — œ, 
xe [0,1]5 


Proof (i) For any countable dense set D in #{[0, 1]°, R), 


un(e) > As(g) as. forany ge D, 


by the strong law of large numbers and this is enough to ensure narrow 
convergence. 
(ii) Immediate from (i) by using Theorem A.5.2. C] 


Theorem B.2.2 Let (U,),> be a Borel sequence in [0,1], let F be a 
distribution function on R, and let G be the inverse function of F, that is, 


G(y)= inf {x: F(x) 2 y} O<y<l. 


Set 


N 
l 
Ful) = 5 D eUs 


n=l 
Then 


(i) a.s. 
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lim sup sup |Fn&) — F(x)| = 0 
Nto F x 


(ii) if F is a continuous function, the distribution of 


sup [F(x) — F@)| 


does not depend on F. 


Proof (i) By using B.2.1., for almost every ® € Q, 


N 
1 
F dy uwrw) — F(x) as N T co 


n=1 


uniformly with respect to F and x. Since G(b) < x iff b < F(x). we have 


N 

1 

N > 1(GWy(@sx} > NF x). 
n=1 


(ii) If F is continuous it takes all the values in the interval ]0, 1[, and 
thus 


sup 
xeR 


N 

1 

N > lunos — FX) 
n=] 


N 

l 

N D: lluny — Y 
n=) 


as. E 


= sup 
ye [0,1] 


B.2.3 Exercise 

Let u be a probability measure on R with continuous distribution function 
F, and let Fy be as defined above. Show that the distribution of ||Fv ~ 
F|| iP» for 1 <p < co, is independent of u. 


B.3 Central Limit Theorem for Empirical Distributions 


Let (Un)n>1 be a Borel sequence in [0, 1]°, and consider the random va" 
ables 


Oy 


MONTE CARLO METHOD 
THE a 


1 n 
Fae) = = D linen (Up. 
i=] , 


(a) We know from the central 
x, /n(F n(x) — T(x)) converges in law 
(b) Consider now the stochastic pr 


limit theorem that for any fixed 
to N(O, n(x) — (n(x)}?). 
ocess 


VF) xe [0, 1°. 


By the k-dimensional central limit theorem, the random vector {J/n- 


(F (x1) — 1x1)), . ss VIF (xx) — T(x4))} converges in law to N;(0, E) 
for any X1,...,Xx in [0, 1]°, where 


Z = E(HH') - EH.EH' 


Lox 1) 
up ) 


ltron 1) 


(c) If s = 1, these limit distributions are the finite-dimensional distri- 
butions of a real centered Gaussian process Z, with covariance 


Zi eS s Atesi s, tin [0, 1]. 


The Brownian bridge B, — tB, has exactly this covariance function. Since 
the distribution of a centered Gaussian process is entirely determined by 
its covariance function, we can recognize Z, as a Brownian bridge. This 
makes it easier to understand the following Doob-Donsker form of the 
Kolmogorov-Smirnov theorem. 


Theorem B.3.1 Let (U,);>, be a Borel sequence in [0, 1]° and let (F,) 
be the sequence of associated empirical distribution functions. We set 


D*(n)= sup |F) - 1(x)| 


xe [0,1}8 


Dt(n)= sup (F,(x) — ne) 


xe [0,1]° 


D {n)= sup (1x) — Fn). 


xe [0,1)* 
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(i) Case s = 1. Let (Bx)xeļ0,ı] be a standard one-dimensi ü 


. . : Onal Bros. 
motion. We have the following convergences in law Min 
L 
\/nD=(n) = sup |B, — xB,| 
xe [0,1] 
A 
VnDi(n)— sup (By — xB)) 
xe [0,1] 
A 
/nD;(n) = sup (xB, — By). 
xe [0,1] 
The limit distribution functions are given for À > 0 by 
+00 
$ lp) 
P( sup |B, - xB;| < a) = X Ee” v 
xe [0,1] : 
J=-@ 
P( sup (B, — xB)) < a) aP | sup (xB; — B,) < | 
xe [0,1] xe [0,1] 
=1—¢™ 


(ii) Case s > 1. There exists a Gaussian centered real process (Z,).¢ 191) 
with continuous paths and covariance 


B[Z,Zy] = nœ ay) - nony)  x,ye [0,1]° 


such that the random variables \/nD*,(n), /nDŁ (n), and \/nDz(n) con- 
verge in law to 


sup |Z;|, sup Z;, and sup (—Z,), 
xe [0,1]5 xe [0,1]5 xe [0,1]5 


respectively. 


B.3.2 Comments | 
Let s = 1, and consider a random sample from a continuous distribution 
function F, with empirical distribution function Fy. From part (ii) of T 
orem B.2.2, we may infer that the conclusions of part (i) of Theorem B.3. 
are still valid with the same limit distributions. + 
We give a rough sketch of the proof of Theorem B.3.1 in the cas¢ $ a y 
We have already noted that finite-dimensional distributions of the proc 


La 


| 
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) = VAF n(x) — n(x) > 

On(x) = VAE RE) — n)), x € [0,1 

| a | [O,1]} converge to those of a Brownian 
Consider the compact Metrizable space Q: [R]QN01 (where R 

RU {+°}), equipped with its Borel G-field. It follows Gon the 

Stone-Welerstrass theorem that the set of continuous functions on Q, 


aD ber of coordinates. is ad 
HO, R). Ihe probability distribution p, of (au(x), x e [0 A 


on these functions to the probability distribution p of {B,—xB,,x € [0, 1]} 
and this proves narrow convergence of Pn to p. But the functional on Q 


0(@)= sup |o(x)| 
x€ QN [0,1] 


is not p-Riemann integrable: it is continuous in the topology of uniform 
convergence, but not in that of pointwise convergence. So we have to 
change topology on Q to make ® continuous. In fact, there are now far 
more continuous bounded functions on Q, and the convergence of finite- 
dimensional distributions is no longer sufficient to ensure that p, con- 
verges narrowly to p. We need the sequence (p,) to have the property 
of relative compactness, which is involved by Prokhorov’s lemma. The 
uniqueness of its limit point follows from the convergence of finite-dimen- 
sional distributions, and so p, converges to p. 


B.3.3 To Go Further 

More precise results than those of Theorem B.3.1 can be stated. For s = 1, 
there is actually a strong invariance principle for empirical distribution 
functions that allows strong approximation of the process {./n(F (x) — 
F(x)),x € [0,1]} by a Gaussian process {Z,,x € [0,1]}, provided F 
is continuous. In the particular case F(x) = x, Komlos, Major, and Tus- 
nady [1975] have exhibited a sequence of independent Brownian motions 


(B"),>, such that 
(log n)* 
a/n 


Recently, F. Bonvalot [1991] and N. Castelle [1991] have completed their 
demonstration by computing three constants C,K, and À such that for 


every n > 3 and y > 0, 


1 k n n 
sup |/n(F,,(x) — x)- —= > (By - xB) 
Sup VF, 0 F2 | 
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Da 


m 
P{ sup sup |m(Fm(x) — x) — Ve — XB1) > (y + Clog ñ)log } 


1<m<n OSX<1 i=1 


< K exp(-Ay). 


For s = 2, Tusnady [1977] has obtained a strong invariance principle . 
the same estimate O((log n)*/n); higher dimensions remain as Yet un : 
plored. = 


B.4 Stopping Criteria and Asymptotic Rate 


The stopping criteria in the classical Monte Carlo method have to be Prob. | 
abilistic. Typically, if n is sufficiently large, there is only a smal] proba. | 
bility that the n-step computed result is far away from the requested Value 


A mesure que l’événement observé se répète, cette vraisemblance augmente el 
finirait par se confondre avec la certitude, si le nombre des répétitions deve. 
nait infini. Il y a ici deux sortes d’approximations: l’une d’elles est relative aux | 
limites prises, de part et d’autre, des possibilités qui donnent au passé plus de | 
vraisemblance: l’autre approximation se rapporte à la probabilité que ces possi- 
bilités tombent dans ces limites. La répétition de l’événement composé accroîtde 
plus en plus cette probabilité, les limites restant les mêmes: elle resserre de plus 
en plus l'intervalle de ces limites, la probabilité restant la même: dans l'infini, 
cet intervalle devient nul, et la probabilité se change en certitude.2 [Pierre Simon 
Laplace, 1821] 


Suppose that (X,,),>1 is a sequence of i.i.d. random variables, and S$, = 
Xı +---+Xy. If E(X?) < ©, for all À > 0, 


> | ere): | 
ni 


p(|Ż- ex, 
n 


Let | 


“As the observed event is repeated, this likelihood correspondingly increases, and would 
become certainty if the number of repetitions became infinite. There are two sorts of appro” 
imation here: one, with respect to given limits, above and below, is of the possibilities ie 
give more likelihood to past events; the other approximation concerns the probability nee 
these possibilities fall within these limits. The repetition of the event steadily increases t : 
probability, for fixed limits: it steadily narrows the interval of these limits, for fixed pro 
bility: in the limit, this interval has zero length, and probability becomes certainty. 


a 
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DU) = E(e 1), re R. 


The set {t: $C) < co} is an interval 7 containing 0, and the Cramer trans- 
form of the distribution of X; is defined by 


h(b) = sup{ bt — log o(t)}. 
tel 


It is easy to see that y(t) = log (t) is a convex function on / and if the inte- 
rior Z of J is nonempty, then y is differentiable on I. Moreover, if 0 € J, 
then E|Xı| < ©; setting m = E(X,), we have w’(0) = m (see Dacunha- 
Castelle and Duflo [1983] for details). We can now state Chernov’s the- 
orem, which is the starting point of the modern theory of large deviations. 


Theorem B.4.1 Let us assume there exists tọ > 0 such that E(e’!*1!) < 
œ, Then for b > E(X,), 


P(S, > nb) < eo"), 
and for b < E(X;), 
P(S, < nb) <e ©), 


In both cases, h(b) > 0. 


Proof Suppose b > E(X,); we handle the other case by replacing (X,) 
with (—X,,). We have 


P(S, > nb) < inf {E(e “"#"r),u EI 40, ~f} 
= inf {e "Ou VU) uel 0, ~[}} 
= exp (-n sup {bu — y(u) uel (1]0, se }). 


Set g(u) = bu — wu) for u € 1. Then g is concave, g(0) = 0, and g’(0) = 
b- E(X,) > 0. It follows that 


sup {g(u),u € 140, of} = AG) > 0. LI 
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The asymptotic speed is ruled by the Hartman—Wintner law of the itera 
logarithm: If the random variables (X,) are i.1.d., centered, with en, 
o? < œ, then "i 


eer Sa Sn 


lim —== = -lim ——— = 1 a.s. 
Ov 2n log log n ov 2n log log n 


Let F, be the empirical distribution function of a Borel sequence in [0, I} 
For all x in [0, 1]’ we have | 


lim ,/——“—— (F,(x)- nx) = VONE) as. 
2log log n 


and hence 


_ 1 
lim ,/—— D*(n) > sup vno- 1) == as. 
2 log log n í 2 


The last inequality is actually an equality [Chung, 1949; Kiefer, 1961] as 
we state in the following theorem. 


Theorem B.4.2 Let (U,,) be a Borel sequence in [0, 1}°. Then 


— n 1 
in) Re 
ii 2log log n ool”) 2? 


and the same equalities hold for D} (n) and D} (n) in place of D%(n). 


B.4.3 Exercise , 
Let F, be the empirical distribution functions of a Borel sequence I” 
[0, 1}°. For 1 < p < œ, set 


D5(n) = ||Fn — n|iLrqo, ir as. 


(1) Deduce from Theorem B.4.2 that : 


f 
| 
| 
i 
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—_ n : 
lim Flos oka D, (n) < |V TA — 1X) I Ip a.s. 


(2) Show that 


EL(D3(n))"] = “(= g =) | 


B.4.4 Remarks 
Let (Xn) be real valued i.i.d. random variables with distribution H and 
‘continuous distribution function F and let (F,) be the associated empirical 
distribution functions. The following are true (cf. Exercise B.2.3). 

(1) [Finkelstein 1971] 


Tim ec, neris es 
si 21og logn =” FT x = 


(2) [Mogul’skii 1977] 


1 


lim V2nlog log n\|Fn — Flu = 5 a.s. 


2 


TU 
lim V2n log log n||Fn — F |l- = 5 a.s. 


B.4.5 Exercise 

Suppose that Dž (n) and D}(n) are calculated relative to a sampling from 
the uniform distribution on [0, 1], and that D3,(m) and D3(n) are calculated 
relative to a second (independent) sampling from the same distribution. 
Show that, for every € > 0, a.s. there exists nọ such that, for n 2 no, 


# T 
lo < log log log n—log5 +€ 
and 
D3(n) 4 
tos zee < log log log n+ log = + €. 
D;(n) 


y 


So 


Ths 
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B.5 Rate of Convergence of Stochastic Approximation Algorit, 


Suppose that we are given à random variable f (x) that depen : 
parameter x, and we want to approximate the point x Where the ki, et 
F(x) = E(f(x)) assumes a given value, or where the maximum Of F oc Re | 
Stochastic approximation procedures are especially useful in this Situa Ur 
we use the Robbins-Monro procedure [1951] for the first Problem a 
Kiefer-Wolfowitz procedure [1952] for the second. : the 
In the Robbins—Monro scheme for finding some x that meets F x) < 
we start with an initial guess X;. We make successive observations Y B 
and (X,) is chosen according to the formula n1), 


Xnvi = An + Yn(B — Ym). 


Under appropriate hypotheses, X, converges to x". (See Figure 2.1) 


Theorem B.5.1 Consider the recursive equations 


Xn = Xn + Yn(B — Yn+1) n:l, 


where X, and Y, are R‘-valued random variables, B € Rf, and X 1 belongs 
to any LP (Q, RÊ), 1 <p < œ. Assume that the following three conditions 
hold: 


| 
| 
(Hı) (Yn) is a decreasing sequence of positive numbers chosen so that | 

D Yn = œ and X y; < ©, | 
(H2) There exists b > 0 such that |Y,| < b for all n > 2. | 
(H3) Let Fi = o(X;) and Fn = 0(X1,...,Xn,Yo,...,¥n), forn 22 | 

There exists a Borel function F : R? > R4 satisfying | 


EY mil Fn] = FX) 


and a point x* in R® such that, for all 5 in ]0, 1[, 


inf ih E 
Te P * X f(x) B >> 0. 


Then X, converge to x* a.s. and in LP(Q; Rd), for all p in [1, l: 
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Figure 2.1 Simulation of the Robbins—Monro algorithm 
a 2 
” 5+Kn 


Starting point X, = 2, from n = 1 to 200,000, and for K = 10,5,2,1,0.5,0.1,0.05, 0.01, 
0.005, and 0.001. 


Ani =X ( 5 = 1X ,>10.U n41 }) 


Proof Let 
ED: and  Ty=Xn-x, for naL 
n=1 


(1) Almost sure convergence. We have 


(B.52) EUT pall Z] = (Tal2 +21 BF Xn), TotW ENB- Ve lA] 


60 COMPUTATION OF EXPECTATIONS IN FINITE DIMENg 


Figure 2.1 (Continued) 


It follows from (H,) and (H;) that 


> RELY - BLF] < s(IBI + bY. 
k=1 


For n > 1, let 


p 


f 


JHE MONTE CARLO METHOD 


K = 0.01 


6 
Br re a 


K = 0.005 


s ; 

x m, ` a s we DE "i | 4, h Bae A 
ms sa 

ue es ee ee wee r Ka 3 Ki rt fr ro re ae 


ru, NS en, Seer en 
prep ere MUC AE CL LIT SE mn VO A 
i Bt ota on tue eget 

wo. . 


Figure 2.1 (Continued) 


n-| 


Zn = (Tal? + s(IBI +5) -= X RENY en - BPF. 
k=] 
Then 


EZ na Pa) = Zn + 2Yn(B = F(X), Tn), 
and by (H3) it follows that (Z,) is a nonnegative supermartingale. Thus 


r * a x > 2 
(Zn) converges a.s. as n tends to infinity, and the same is true for |T, 
and Le Pen E 
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een %, | 
(2) L?-convergence. Taking expectation in (B.5.2) and summi % | 
n, we get NE oy 
(B.5.3) sup E(|Tn|7) < E(|71|*) + s(B + b}. 
n21 


We observe that 


(B.5.4) [Ths 4 = irr + 2¥n{B — F(Xn), Tn) + 2Yn(F (Xn) = 


3 à n+l, Ta) 
+ Yn IB = Fil 
and so 
(B.5.5) nul? < ITa]? + X FX») — Yess Te) 
k=l 
+ >) vel¥en — BP. 
k=l 
Let 
neli 
Na = D 20E) - Yes Te), n22 
k=1 | 
and 
N° = sup |N;|. 
k22 


The sequence (N,,.7,; n 2 2) is a martingale and from the Burkholder 
inequality it follows that for 1 < p < œ, 


j 1/2 
(B.5.6) IV" lp < C(p) (>: Ay? |T.|7|F (Xx) — ru r) ’ 


k=] p 


where C(p) is the constant in the Burkholder inequality. Hence, 


(B.5.7) IN" lp < C(p)4bV/‘s || sup |T¢lllp- 
k21 


| 
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Put 

T° = sup |T;|. 

k>1 

From (B.5.5), we get 
(B.5.8) T*? < |T|? +N* + s(|B| + b)? 
and so it follows from (B.5.7) that 
(B.5.9) IT, < IIT: llp + SUBI + 5)? + Cp) 4bVSI|T" IL. 
By Doob’s inequality (i.e., (B.5.6) with p = 2), we have 


INIT SIN *|l3 < 6462s sup (E|T;”), 
| kzi 


and this last expression is finite by (B.5.3). From (B.5.8) and (B.5.9) it 
follows successively that. 


IT“ ll2 <% 
and 
IT*lp <%  forany pe [1,f; 


an application of the Lebesgue dominated convergence theorem yields 
L?-convergence. 
(3) Convergence to x*. It follows from (B.5.2) that 


0< Y MELT o FA = BY ET 11?) + s(1B] + 8°, 
k=) 


and so 


Y WEKT. FO ~ Bo <=. 
k=1 


COMPUTATION OF EXPECTATIONS IN FINITE DIMEN 
SION 
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By (H3) | 


ca 


DAT e, FX) — B) = + 


k=1 


on the set {ð < lim |7;| < 1/8}. Therefore this set is P-negligible. D 


B.5.10 Exercise 
Write a program to implement the following algorithm: 


Agel 


1 1 
Die: =Xn se | n+ n= 1, 
l (0 pate, 4) 


where (U,,) is a Borel sequence in [0,1]. Show that the assumptions of the 
theorem hold with 


| 1 n 
F(x)= (1) vo B=5,. .x*=5. 


B.5.11 Stopping Criteria | 
By running the program in Exercise B.5.10, we see that using a stochastic 
algorithm may be hazardous without stopping criteria. In particular, the 
speed of convergence may be so low that the sequence becomes in prac- 
tice constant, before reaching the theoretical limit. With some additional 


assumptions Dvoretzky [1956] has given the following result on the rate 
of convergence. 


Theorem B.5.12 Let a,o and c be positive constants and assume that | 
(again in R4) 


Xni =Xq + Yn(B — Yni) nèl, 


with the following hypotheses 
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al 
(H) 


2 
ca 


Vn Bk: a 
O? + nc2a2 


n21l 


(H3) Let A, = oX) and .F, = o(X1,...,Xm, Ÿ2,..., Ÿn). For n 2 2, 
there exists a Borel function F : R4 — Rd satisfying 


ELY n+ |] = FX») n2l 


ang a point x" in R such that, for all i= 1,...,d and for all x in 
R°, if x; = x; then F;(x) = B;, and if x; 4 x; then 


ae F(x) — Bi P orro 


k 
Xj — X; ca? 


(H3) 


sup E(|F(Xn) J Y nat 1) = oc 


n21 


Suppose moreover that 
El = Pyse. 
Then 


à oa’ 
— x*|-) < ms 21. 
E(X, -x 1) o2 + (n — 1)c?a? . 


Proof Let T, = X, -x° for n 2 1. Then 


EIT nal = ET nl2) + 2 MB - FX), Ta?) + WEIB — Yr |] 
= E(IT, |?) + 2MELB - FX), Tn)] 
+ PEIB - FAT + ye ECE An) — Yael. 


For any x in Rf 


à 
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DiGi x) + 2 Bi — FCO — 47) + VF) — BY?) 


i=l 
>on ( ~ 2Y%n He FQ) Bi +, (BOB ren ) 
k= 
- - | *\2 fi) B; | 
sS - +7) [ro ) i | 


i=l 


* 
Fora; Fa, 


O° + F(x) — bB; 
per <1 — O BR ye, 


and therefore 
E(\Pnsil?) < EITADA - e? + yo? 
If 


ga 
E(|T,| < — 
(Ta) Oo? + (n — 1)c2a2 | 


a straightforward computation proves that 


oa? 


2 
E(\Tnsi| )s o? + nea?’ 


and this completes the proof. 


B.5.13 Exercise 
Show that, if 0 < X; < 10 and O < k < 10/52, the algorithm 


5+k.n\2 


meets the requirements of Theorem B.5.12 [Hint: Show that -(5+kn) : 
Xn < 10+(5+ kny! for all n > 1, and in fact —0.2 < X, < 10.2 for al 
hæ 1.] 


2 1 
An = Xn + Å | — Lx, >10 Ur ) 


P- 
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C1 Koksma-Hlawka Inequality 


c.1.1 Notation 
For x in [0, 1]°, we write x the symmetrical point of x with respect to the 


center of TR [0, 1J". If f : [0,1] — R, we denote by f the 
function f (x) = fŒ). 


Definition C.1.2 A function f : [0,1] — R is said to have finite 
variation if there exists a bounded measure u on the Borel sets of [0, 1]°, 
with support in [0, 1]°\{0}, such that 


f(x) =fO)+p{{[0,x]]}  forallxin[0,1F. 


This measure is unique and its mass |||| is called the variation of f and 
denoted by V(f). 


C.1.3 Remark 

A function f with finite variation is Riemann integrable. In fact, f is 
bounded and the discontinuity points of f are contained in the countable 
union of hyperplanes {a : {x : dis.t.x; = a;} #0}. 


Theorem C.1.4 (Koksma—Hlawka inequality) Let f have finite vari- 


ation on [0, 1]° and suppose we are given a sequence € = (€,) in [0, 1]. 
Then, for any N > Q, 


re | 
fOdx- + 2 fEn| <VIDÉE,N). 


[0,1]5 


Let ji denote the image of u under the symmetry with respect to the center 
of the hypercube. 


Lemma C.1.5 Let u be a bounded measure on [0, 1]°, with distribu- 
tion function F. We have 


| T(x) dÜ(x) = | F(x) dx. 
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Proof of the Lemma 


frac = Í Í liro ( y) dydH(x) 
= {] I (0,2) ¥) 4x) dy 
= | J rosno due ay 
z Í FG)dy. 
Proof of the Theorem Put F =f — f (0). Then 


N 


N 
à X re fiwa ED FE) - ffa 


n=] 


1 
= + DFÉ) - f F(x)dx 


n=1 


1 N 
— Lis, — T(x) 
(Èo) 
1 N 
a | ( x > le ro) dji(x). 
n=) 


= 


d(x) 


——, 


Therefore 


< DE, M)IIĂII = D36, NII. 


re i 
wv ZE [reo x 


C.1.6 Remark 
The same proof shows that if 


f(x) =f) | h( y)dy 
(10.1) 


for some A in L4(dx), with 1 < q < œ, then for p = g/(q — 1), 


ee aS 
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1 N 
+ DE - [rc dx 


n=l 


< DE, N)|lhllą. 


C.1.7 Exercise 
Let f, 8: [0, 1] — R, and let À be a real number. Show that 


(a) F +g) < V(f) + VC). 
(b) VAS) = AV). 
(c) [lf ll < IFC + VC). 


C.1.8 Exercise . 
Let f: [0, 1]* — R be of class C*. Show that 


0‘f 


KUJ OX, +++ OXs 


1 


C.1.9 Exercise 
Define f: [0,1] — R by 


f(%1,X2,.X3) = (x1 + x + x3) À 1. 


Show that f does not have finite variation. [Hint: Show that the distribution 
d°f/dx;0x20x; is not a measure. ] 


C.2 Lower Bounds of Discrepancy 


Theorem C.2.1 (Roth) Let s > 2 and let § be any sequence in [0, 1]°. 
For every N > 1, 


DE, N) 2 DIEN) > C(s) , 


(log Nys-b/2 
N 


Where C(s) > 0 is an absolute constant depending only on s. 


Corollary C.2.2 Let s> 1. For any sequence € in [0,1] we have 
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( N s/2 
pi(é,N)2C'() 


s N, where C!(s) > 0 is an absolut 


for infinitely many positive integer 


constant depending only on S. 
Proof of the Corollary Let N 2 1, and consider the finite Sequence 


_,N} in [0, 1]5*!, where 


(etr) 


..,¥se1) in [0, 1]° x [0, 1[ such 


£2 iene br 


By the above theorem, there exists y = ()1, 
that 


Ny/2 | 
Gi pe EN. | 


È > 1 (0, y)@n) — HY) 


Now let m be the positive integer for which (m — 1)/N < Ys+ı < m/N and 


put y® = (y1,...,y,). Then 


2 Loyo Én) — may) 
n=] 


— INT y) — mn(y®)| 


m 
2 > Loy (Gn) — NT y) 
n=] 


— [Nn y) - mn y)| 


N 
s ` loy n) — NT y) 


n=Í 


2 C(s + 1)(log N)? — n(OYO|N ys — ml 
> C(s + 1)(log N)*/? — | 


It foll 
such in that, for sufficiently large N, there exists m with 1 < M £ N 


mD_(6,m) > C'(s\(log Ny‘/2 > C'(s)\(log m)°?2. 
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Letting N tend to infinity completes the proof. E 


C.2.3 Remark 
For logarithms to the base 2, the above statements are valid with 


C(s) = CS a 1027 
and 
C'(s) = C@+1)—€. 


C.2.4 Exercise 
For s = 1, show that we may not replace “infinitely many positive integers” 
by “for N sufficiently large” in the statement of Corollary C.2.2. 


C.3 Irrational Translations on the Torus 


We shall say that & = (o,...,0t,) in R° is irrational if 1,0,..., Os are 
linearly independent over the rational numbers. Let œ be irrational and, 
for x in Rf, let 6, = {x + na} for n > 1. If m = (m,...,mq) 4 0, and 
N = 1, we have 


N . 
1S nn) L amine) aD — 
N N e2inkm,o) ` 
n= 


By the Weyl criterion, the sequence & = (Ë,),> is u.d. in [0,1] as in 
Example A.5.4.1. We recall the following definition. 


Definition C.3.1 Let (Q,.4,P) be a probability space. A measur- 
able transformation 6: Q — Q is called measure-preserving if P(@~!(A)) = 
P(A) holds for all A in. 4. A measure-preserving transformation 0 of Q 
is called ergodic if for every set A in. @ with 67 !(A) = A P-a.s., we have 
P(A) = 0 or 1. 


Lemma C.3.2 Let @ be a measure-preserving transformation and let 
U:f ++ f o @ be the associated linear isometric operator on L7(Q,. #,P). 


Then @ is ergodic iff the equation Uf =f, f € L?, has only constant 
Solutions. 


Proof Exercise. 
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We shall identify [0, 1[° with the s-dimensional torus Toi 


< 


endowed with the finest topology for which the canonical Projection T /2: 
T, is continuous. This topology is compatible with the Metric S 


Hs EE) = Sup {min (|x; = l- x; = yil)} 


1=1),...,8 


which is translation invariant. 


Theorem C.3.3 The translation Oa: x > {x + @} in T; is ergo dic i 
© is irrational. 


Proof For m in Z° put Yml) = e #0, The functions (Wm, m e Z’) 
set up an orthonormal basis in L?(Ts, Às). For every f in L?(T,, A), 
If © 8all2 = If lle. 


It follows that 6, is measure-preserving and the associated operator U, 
satisfies 


2im(m,QL) 


UaWm = Wn © Oo = € Wn. 


It follows from the lemma that a necessary condition for 0, to be 
ergodic is that 


emo) 41 forall m0, 


which is equivalent to the irrationality of a. Conversely, let us assume that 
© is irrational, and let f in L?(T;,A,;) be represented by its Fourier series 


J E X Ayn. 
me ZS 
If f is invariant, then 
0= Ua -f= D (MD ANN m. 


me ZZ 


Therefore f (m) = 0 for all m + 0, and f is constant. 


| 
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It follows from the pointwise ergodic theorem that for every f in 
L! (Ts, As): 


1 N 
F Df + na) ur. [foray 
n=] 


for As-almost every x in T;. If in addition f is Riemann integrable, we 
know that convergence holds everywhere in T,. Moreover, the following 
statement is true. 


Proposition C.3.4 Let © be irrational in R5. 


(i) The only probability measure u on T, such that (6,):u = u is the 
Lebesgue measure Às. 


(ii) For all f in @(T;), 


lim sup 


== 0: 
NTœ x 


ix | 
FD eae [rod 


n=1 


Proof (i) Let f be in @ (Ts) and let p be a probability measure on T, 
such that (64)+«u1 = u. Then 


ee ae 
CAE 


It follows from the pointwise convergence of (1/N) >. T © 06.10 
[f(>) dy and the Dominated Convergence Theorem that 


f (y)dy = U(f), 
and this proves the first statement. 


(ii) Set 


N 
l n 
Qn = FD Soo 


n=| 
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f . N Sig, È 
It follows from the uniform continuity of f on the compa 


space T, that, for every € > 0, there exists h > O such that *USdont 
d(x,y) < h = |f(x) -f(y)| <e 


and therefore 


N 
D> IS 28a) -F 0 Oy] <e. 
n=1 


Z| 


lon) — oxy) < 


The uniform convergence of (@y)is now a consequence of pointwise con 
vergence and of Ascoli’s theorem. i 


C.3.5 Remark 

It is worth noting that statements (i) and (ii) are equivalent for any compacy 
metric space E endowed with its Borel O-field €, a probability Measure y 
and a measure-preserving transformation @ [Cornfeld, Fomin, and Sinai, 
1982]. 


C.3.6 Exercise 

There is another way to prove part (ii) of Proposition C.3.4. If f is 
a trigonometric polynomial with no constant term, show that the sum 
5. ı f © Oa remains bounded and use the Stone—Weierstrass theorem to 
complete the proof. 


C.3.7 Remark 
For any sequence € = (E,,),>; of real numbers in [0, 1], set 


N 7 
(É) = sup toe DEN). 


Then [Dupain and Sós, 1980] 


En V2})) = (4log (1 + V2)! 


and this is the minimum value among all the numbers @(({nV@})): 


C.3.8 Examples ' 
(1) [Hecke, 1922] for s = 1, [W.M. Schmidt, 1970]. Let © = (a1,-- see 
be an irrational such that each œ; is an algebraic real number, an 
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E= ({na}). Then, for every € > 0, there exists C(a, €) > O such that 
DE, N) < C(a,€)—— 
co 2 — 3 Ni-€ be 


(2) [A. Baker, 1965]. Let B = (Bi,..., Bs) with B; = ei, where the r; are 
distinct non-zero rational numbers. If § = ({mB}), then for any € > 0 there 
exists a number C(f, €) such that 


y 1 
DL, N SCB Àm 


C.3.9 Points of the Torus with Rational Coordinates 


It is possible to show, using the property of unique factorization in the 
ring Z[i] of Gaussian integers, that the algorithm 


ees 4 
Xintl = 5X1,n Sin 

_ 4 3 
Vint = 5X1n + SYin 

_ 5 12 
X2,n+1 = 73%2,.n — 73Y2,n 


_ 12 5 
Y2,nt1 = 73%2,n + 73Y2,n> 


starting from a point ((x1 0, 1,0), (%2,0, Y2,0)) of the torus T2, defines an u.d. 
sequence. 


More generally, the sequence ¢, = (G7, ..., 6X), where the irreducible 
form 


pj qj 
en 
rj fj 


involves in the decomposition of r; a prime number that does not appear 
in that of r1, -++ ,rj-1 [Bouleau, (11)]. 


C.4 Other Usual Sequences with Low Discrepancy 


C.4.1 Halton Sequence n 
Let p > 2 be an integer. If the p-adic expansion of n in N is 


k 
n=ag+ap+t:':+akp 3 


where a; e {0,...,p— 1} for i=0,...,k, then @p(n) is defined by 
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prime. The following discrepancy estimate holds [Meijer, 1968] 


[ON 
do dı dk | 
n(n) = + D + aE, 
A Halton sequence & = (£,) in [0, 1}° is defined by | 
Sn = (Pr)... PO), n21, | 
| | 
Where p1,...,p, are integers that are greater than 1 and pairwise relatively | 
| 


1 = + pi-1 | 
DENZ : + (2log N) , > | 


This means that 
DŽ (É, N) = O(N-\(og NY’). 


C.4.2 Hammersley Sequence 
The Hammersley sequence of order N in [0, 1F(s 2 2) is defined by ` 


n 
En = ( T Pa Pri) I£<n<N, 


where r1,...,r,-1 are the first s— 1 primes. In this case 


C.4.3 Lapeyre-Pagès Sequences 
Let p 2 2 be an integer. For real number x in [0,1[, the p-adic expansion 


is said to be proper if the set {k:x, 4 p — 1} is infinite. If 
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are the proper p-adic expansions of x and y in [0,1[, let x ®, y be the 
number z in [0, 1] with p-adic expansion 


k=0 
where 
Zo = Xo + Yo mod p 
Zn=Xn tyn mod p if Xn-1 + Yn-1 <P 


=1+x,+y, modp if Xn-1 + Yn-1 = P. 
Now suppose that p1,...,p, are greater than 1 and pairwise relatively 
prime. Let ÿi,::.39.1be numbers in [O, 1[ such that, for i = 1,...,s, y; 
has a finite p;-adic expansion and lies in [1/p;, 1[. For x in [0, 1[, put 
Ty,(x) = yi Dp; x. 
Finally, for © in T,, consider the sequence 6 = (6,) defined by 


(Te (01) ste (oF) 


The following estimate holds: 


| > log p;N 
"(ENS —|1+ ;-)p = 
DENS Il ares 


Note that this estimate does not depend on Q. 


C.4.4 Faure Sequences | l 
Let s > 1 and let p > s be a prime number. Let Ap be the set of p-adic 
rationals in [0,1{ (i.e., those with finite p-adic expansion). Define Cp: Ap > 
À, for | 

p 
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R? 
ao ak Stoy 
X= —+...+ 
p pi*! 
by | 
b | 
Cp) = — t+ + SE, | 
where | 
k it 
ed mod p. 
’ DTH i 
Set 
C!=0 if j>i>0 
i! 
E | a SI 
j=) i 


Lemma C.4.5 For n > 1, the infinite matrix C = (C!) satisfies 
Cen a, 


Proof We proceed by induction. The statement holds for n= 1. Sup- 
pose that it holds for some n> 1. Then 


i 


C Sa = yn OG 


k=j k=j 
i i—j 
_ > | i-k pi pk-i _ hi > | l „i-j-l 
= n C; Gi = C; Ci-;n 
kj 1=0 


= Clint D, 


so it holds for n + 1. 


Thus, it turns out that the map Cp is injective and satisfies 
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(aia. AC =. 
C.4.6 Exercise 
Is the mapping Cp extendable to a Riemann integrable mapping from 


(0, 1] into [0,1]? 


Proposition C.4.7 If s 22 and p 2 3 is a prime such that p 2 s, then 
the sequence 


Ci = (P (1), Ch(Ep(n)), rey Cy '(@p(n))) 


satisfies 


* 4 Es pi i s s—1 
venii) (log N) + O(log N)™’)| . 


We refer to H. Faure, [1982] for a complete proof of this proposition. Here 
we only derive an interesting geometric property of Faure sequences. 


Proposition C.4.8 Letme N and! € N°. For any choice of r1,...,rs 
and uj, ..., Uus in N with rı +---+rs =m and ug < p’*, there is exactly one 
element of the sequence Eipm, Bipm+1, Roe , Elpm+pm-1 in the s-dimensional 
interval J = | [j_,[ux/p™. (ux + 1)/p™[. 


Proof For n2 1, let 


co 


n= > ai(n)p' 


i=0 


be the p-adic expansion of n. Note first that, for i 2 m, each a;(n) is 
uniquely determined by the condition /p" < n < lp” +p”. For k =1,...,8, 
we have 


œo (k) 
_ ok-l — yj (n) 
Gak = Ci (®(n)) = 2. “pitt? 


Where 


i CTATIONS IN FIN 


E ITE 
ON OF EXP DIM, 
COMPUTATI MENS, | 


80 | 
Re i-1 y, mod p. | 
ioe — 1) ai(n) p | 
po- Scie Ta 
2 


Din for 0 <j <7 1 follow fro | 
f y: (n) M the | 
Existence and uniqueness Ora € | 


condition 
uz + 1 
u we ge 
E, < Enk < rk 
pr P 


Therefore the vector (ai(n),0 £1 <m-—l)1s ovin 
system of equations in Z/pZ. 


Se k- Dan) = yj.) modp 
i=jk 

k=1,...,5S 

Ieeh 1 


l > i ith m unknown quantities. Its 
This is a system of X4; rx = m equations with í a a Ro. 
determinant is a kind of Van der Monde determinant and Its value is 


JI (k — hyr 40. 


1<h<k<s 


Accordingly, the problem of finding n such that &„ belongs to / and 


lp” <n<(1+1)p™ 
has exactly one solution. D 


C.4.9 Approximate Sequences | | 
In practice we are often forced to use sequences (y,) to approximate | 


sequences (€,) which have suitable asymptotic behavior, but do not have 
rational components. 


Lemma C.4.10 Let & = (Gr) and n = (Nn) be two sequences in [0: I] 
such that 
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ae lőni — Nnil < h(n). 
1,6 
Then, for every p in [1,-[ and every N > 1, 
. g ee 
DEN) - DNS D aw. 
n=1 


Proof 


1 N 
Fy Lomin) — 70) 


< 


N 

1 

N > liana — T(x) 
n=1 


ý | 
1 
Ra ` | Lro En) — Liron) 


Take now the L?-norm to obtain 
| N 
D&N) < D M,N) + — D h). 
n=1 


It is clear that if h(n) — 0 as n tends to infinity, then the sequences € and 
n are simultaneously u.d. We have already seen in Section C.3 a metric 
on the torus T, for which the canonical projection x € R? — {x} e T, 
is continuous. Now suppose that & = (6,) and n = (Nn) are sequences in 
[0, 1]° such that d(E,, Nn) < h(n) for every n. We write r(n) for the number 
of indices k < n such that the distance d(E,, B) from &; to the edge of the 
cube [0, 1]° is no more than h(k). Then 


PS, r(N) 

* * AAA /p als 

DYE,N) - DM N)| < — > a + 
=l 


If € is u.d. and A(n) — 0 as n tends to infinity, then r(N)/N — 0 as N T oo 
and the sequence n is also u.d. 
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C.4.11 Exercises 
(1) Let pı, ...,ps be the first s prime numbers and, for k = | 
sequences x* be defined by ren, 


Show that the sequence ({7x,},..., {7x5 }) is u.d. in [0, 1} (itis an 
imate sequence of ({n4/p1},...,{n4/ps})). noes 
(2) Let & = (e/1,...,e's), where the r; are distinct rational numbers, B 
result of A. Baker (cf., A.5.4.1)) the sequence {no} = ({ne"},... {et 
is u.d. in [0, 1]°. Show that the sequence ({na}}, isa CAGE D) defined e 


ak =ak_,+rk 
k_ k fk 
FT es 
n nly 


with af = r% = 1, is u.d. in [0, 1}. 
To be useful in practice, the sequences x* and a“ in the previous exer- 
cises must be computed with increasing accuracy (like log, n) by repre- 


senting a real number by an array of digits. 


C.5 Low-Discrepancy Sequences and Simulation 

(a) Suppose that X is a uniformly distributed random variable on [0, IP 
and f is a Riemann-integrable function on [0, 1]*. Instead of computing 
E(f(X)) by the law of large numbers, that is, 


N 
1 
E(f (X)) = lim — U Ss. 
(FO) = lim = 2" n) as 
where (U,) is a Borel sequence in [0,1], we may replace (U,) with à 


uniformly distributed sequence (6,) in [0, 17°. 


C.5.1 Remarks th call 
(1) As Un = (Un1,...,Un,), this amounts to replacing the aes with 


EE ARE à f 
to the random function (i.e., the pseudo-random numbers generato 
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E,j(1 SJ S 8). It would be wrong to take a u.d. sequence in [0, 1] and use 
it instead of the sequence of calls to the random function. 


(2) The advantage of this method is that it goes faster, provided that 
the sequence (n) is suitable and f is not too irre 


<i a gular; furthermore, there 
is a deterministic stopping criterion, the Koksm 


a—Hlawka inequality. 


C.5.2 Exercise 
For the Van der Corput sequence x, = ©2(n), show that 


so this sequence is not pseudo-random. [Hint: express x2, and x,:1 as 
functions of x,.] 

(b) Another typical example of the efficiency of low discrepancy 
sequences is given by simulating a real random variable X by a rejec- 
tion method. Let (U,) be a Borel sequence in [0, 1]°, let A be a Borel set 
in [0,1]* with negligible boundary and positive Lebesgue measure, and 
let T, = inf{n 2 1: U, € A}. Suppose that there is a Riemann integrable 
function f such that the distribution of the random variable X is given by 
f(Ur). We have seen the distribution of Ur is uniform in A and so 


1 


EX = Al | fe dx. 


In the classical Monte Carlo method, the value of EX is evaluated by 


N 


D FU AU 


igs (&n) is u.d. in [0,1], however, we also have (exercise) 
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“Ry, h 


N 
> FEn)1a En) 


n=! 


i l 
> uC 
n=l 


can be replaced by Successiv, 
Call, 


EX = lim 


=l 
NÎ 


that is, the Borel sequence in [0, 1]° 
to a low discrepancy sequence. 


C.5.3 Comparison Between the Monte Carlo Method ang 
Quasi-Monte Carlo Methods | 


Difference of Nature Between the Two Approaches The use of Dsey | 
random numbers for computing expectations is quite a common techni a 
It works well for computations with ordinary accuracy, which is oie 
that is wanted when engineers use probabilistic models. 4 

There are, however, several points that are worth keeping in mind, 

(1) No explicit sequence of digits has been proved to have à random 
character. There are deep theoretical reasons for this [Martin-Lof, 1966. 
Dellacherie, 1978; Knuth, 1981]. i 

(2) We can ask not that the sequence satisfy every statement in set 
theory which is true almost surely, but only that it should satisfy the main 
ones: normal character in the sense of Borel, the y-square law, and these 
properties for images of the sequence under a simple family of functions. 
Even in this case, there are very few sequences for which these proper- 
ties have been proved (Champernowne sequence or improvements of it 
[Rauzy, 1981, Dumont and Thomas, 1986], etc.), and in practice they are 
far from being good pseudo-random sequences. 

(3) In practice, pseudo-random sequences are obtained in the following 
way. Families of generators are examined by statistical tests and the 
sequences that behave badly are eliminated. There are a great number 
of tests [Knuth, 1981; Niederreiter, 1978; Ripley 1987; Fushimi, 1988; 
Marsaglia and Zaman, 1990; Anderson, 1990; Altman, 1988; Marsaglia, 
1985; Ripley, 1990; Flajolet, Kirschenhofer, and Tichy, 1988]. Thus on 
gets sequences which can be considered as convenient. Most of them 
are finite (i.e., periodic) sequences but it is also possible to build infinite 
sequences (cf., Exercise B.1.3). se 

The Monte Carlo method works by using these convenient ee 
for computations in the framework of convergence theorems, which : d. 
essentially just new statistical tests that the sequence has not yet pe w 
If the sequence has been used for a long time, then it is reasona 
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think that if it had anomalies they would already be detected. Nevertheless 
the need for longer and longer finite sequences and the imagination of 
mathematicians yield many new finite or infinite sequences; for instance, 


more and more algorithms (discrete dynamical systems) are known to have 
chaotic behavior. 


There is therefore a bet involved in applying the Monte Carlo 
method, which must in consequence be used carefully. The equidistributed 
sequences used in quasi-Monte Carlo methods are quite different in nature, 
in that the equidistribution property is often proved by a demonstration 
that yields an explicit bound of their discrepancy. In other words, only 
one property (equidistribution) of the sequence is used in the quasi-Monte 
Carlo method, and this property is proved to hold. On the other hand, 
the Monte Carlo method is reliable so long as the mathematical model 
to be computed has no relationship with the dynamics of the sequence 
being used. Unfortunately, no rigorous analysis of this “independence” is 
yet available and it is therefore necessary to check that the results do not 
depend on the family of generators and to check that they are continuous 
with respect to the parameters of the model (when such a continuity holds). 


Si haut que l’on remonte dans l’histoire des sciences, on voit la certitude 


que donnent les mathématiques leur valoir un rang éminent dans l’esprit des 
hommes. [Henri Lebesgue, 1938] 


In mathematics we are lucky to have proofs at all! Other subjects are not so 
fortunate. [Laurence C. Young, 1981] — 


Quantitative Comparison of the Speed of Two Methods This sort of com- 
parison was made in Sarkar and Prasad [1987] and Pagès and Xiao [1991]. 
These works show that, for all currently known families of low-discrep- 
ancy sequences on [0,1]*°, the number of drawings required for these 
sequences to equal (and then surpass) the accuracy obtained by pseudo- 
random sequences increases rather quickly with the dimension s. Thus the 
use of low-discrepancy sequences is in practice limited to low dimensional 
problems. We extract from the study quoted in Pagès and Xiao [1991] the 
following results: 

In Figure 2.2, the expression “k times better than random” does not 
mean that the sequence is compared with a random sequence, which would 
give a random result, but that it is compared with the asymptotic speed of 
random sequences given by the Chung-Kiefer theorem (Theorem B.4.2). 


‘However far we go back through the history of science, we see how the certainty given by 
Mathematics affords it a high rank in the minds of men. 
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Studied sequence: irrational translation on the tours. 
Translation of the square roots of the prime numbers mod 1 


20 
= 15 
2 
wo 
5 13 More than 2 times 
Ẹ better than random 
Q 

11 

10 

9 


7 More than 5 times 


better than random 


5 x 10410° 5 x 105 108 
Number of drawings 


Figure 2.2 


C.6 Complement: Boreloid Sequences 


We have seen in Section C.5 that random sequences converge more slowly 
for integration than low-discrepancy sequences, at least when the dimen- 
sion s is small. Nevertheless, they can be used for very irregular functions 
since, by the strong law of large numbers, 


N 
Yf e L'((0,18,dx)  Ef= lim 2 ` fn) as. 
D n=1 


for every Borel sequence (U,,),>; on [0, 1]°. . A 
Thus it is natural to ask whether there exist random sequences X7 


(X,,(@)), which, like Borel sequences, can be used to integrate irregular 
functions and, as the low discrepancy sequences, have the property of ru 


ning fast. 


| 
| 
| 
| 
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Studied Sequence: classical Halton sequence 


20 More than 2 times 


better than random 


de 15 

2 

e 

© 13 More than 5 times 
£ better than random 


More than 10 times 
better than random 


More than 20 times 
better than random 


More than 50 times 
better than random 1 


5 x 108 105 
Number of drawings 


Figure 2.2 (Continued) 


5 x 104 10° 


We emphasize the fact that the study presented here has a theoretical 
character. (It will become apparent in Chapters 3 and 5 that integrating 


very irregular Borel functions poses practical problems even with good 
pseudo-random sequences.) 


Definition C.6.1 Letu bea probability measure on ([0, 1]°,.Z[0, 1]°). 


(i) A sequence of random variables (X,,) with values in [0,1] is said 
to be almost surely \1-distributed, if the sequence (X,(@)},>1 is u- 
distributed on [0, 1]° with probability one. 

(ii) A sequence of random variables (X,) with values in [0, 1]° is said 


to be Boreloid u-distributed if, for every bounded Borel function f 
from [0, 1]° into R, 


N 
(*) à DICO fra as 
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Studied sequence: Lapeyre-Pages sequence 


20 


More than 2 times better than random 


_ 
O1 


More than 5 times better than random 


-à 
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Dimension 
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More than 10 times better than random 


N 


More than 20 times better than random 


More than 50 times better than random 
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Figure 2.2 (Continued) 


Using the same argument that derives the fundamental theorem of statistics 
(also called Glivenko—Cantelli theorem; cf. Theorem B.2.1) from the law 
of large numbers, we see that if a sequence (X,) is Boreloid u-distributed 
then it is almost surely li-distributed. Indeed, for such a sequence we have 


1k 7 
lim x D) - [rau a.s. 


for every f in a countable dense subset D of @({0, 1]°) and so 
1 À 
€ 4 1 paues à Xn À d Vf E D. 
a.s. lim N È fXn) = |J du 


This easily implies 


ë 15 
© 
É 
; 13 
a 4 More than 10 times 
better than random | 
10 | 
g + More than 5 times | 
better than rando | 
7 | 
More than 20 times | 
5 better than random 
3 
2 More than 50 times better than random — 
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studied sequence: Faure sequence 


More than 2 times 


f better than random 


20 


More than 5 times 
better than random 
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ns 
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Number of drawings 


Figure 2.2 (Continued) 


N 
1 
a.s. N > dx, — narrowly. 
= : 


Proposition C.6.2 Let (X,) be a sequence of random variables with 
values in [0, 1]°. Suppose that for every open set A in [0, 1}° 


N 
l 
i sn 1 Xn — (A) a.s. 
(#*) wd A(Xn) >H 


Then (X,) is Boreloid p1-distributed. 


For the proof we use the following lemma. 


Lemma C.6.3 Property (**) for open sets implies property (*) for all 
Continuous f 


i eevee L 
Proof of Lemma Let f be continuous. We may assume 0 < f < 


g P 
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Then 


(fpa |) =e | 
n n 


increasing open sets. We uniformly approximate f 3 | 
| 


where the Gg are i 


€ by 
K 
k=1 


. N 
where Az = Ge\Gr. Then fx satisfies ((1/N) >. fK(Xn) > [feau 


a.s.); hence 


N 
Hg Da € Jr du +2 


N 
ANT 
img D fou 2 [y du 26 


E 


which gives the lemma. 


Proof of Proposition To prove the proposition, we recall the Lusin 
property for a bounded Borel function f: For every sequence (ex) 
decreasing to zero, there exists a sequence of open sets Gx and a sequence 
of continuous functions fg such that Vk, 


W(Gx) < €k 
f =f, outside Gy 


Fell < Ifl- | 


[Rudin, 1966]. Now | | 


| ee | 
FX t 24 RO 


iM 


— 


j| = | 1 
PAKO 


By the lemma, the following inequalities are valid a.s. 
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lim = È f(Xn) < | fran + If ll u(G,) 
1 N 
im gD Fw) > [au - tutos 
Hence lim (1/N) 2a FXn) = |f du as. b 


En lisant M. Lusin, on sera sans doute étonné q’ 


. , Te apprendre que j’ai entre autre 
chose, invente le procede des cribles, construit le ae 


t j ; premier ensemble non analy- 
tique. Nul du moins ne sera plus étonné que je ne le fus. M. Lusin n’est oe 


faitement heureux que s’il a pu attribuer à quelque autre ses propres découvertes. 
Etrange manie; pardonnable il me semble, car il n’est guère à redouter que M. 
Lusin fasse école sur ce point.4 [Henri Lebesgue, 1930] 


C.64 Remarks 


(1) Let u = dx on [0, 1}. The condition that (**) is satisfied for rect- 
angles with rational vertices is equivalent to (X,) being almost surely dx- 
distributed and is also equivalent to (*) being satisfied for all continuous 
f (Exercise). 

(2) There exist almost surely -distributed sequences that are not 


Boreloid u-distributed: Let & be a p-distributed sequence and put Xn = Gi 
for n > 1. Then (*) fails for 


f = Weer En=x} 


(3) There exist Boreloid p-distributed sequences: Let (Xn) be i.i.d. with 
common law i. 


Proposition C.6.5 Let € = (n) be a uniformly distributed sequence on 
(0, 1]°, and let (V,) be a sequence of i.i.d. random variables with absolutely 
continuous common law v. Then the sequence Xn = {E,+Vn} is Boreloid 
uniformly distributed on the cube. 


Proof We write |A| for the Lebesgue measure of a Borel subset A of 


“While reading M. Lusin, people will no doubt be astonished to learn that I have, e 
other things, invented the sieve method and constructed the first non-analytic set. ane y, 
at least, will be more surprised than 1 was. M. Lusin is only completely happy has i ne 
‘ltribute his own discoveries to someone else. A strange habit; nee seems ; 
Since there is little chance that anyone will follow M. Lusin down that road. 
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Sy 
the cube, and put 


= f Latin + x Pdv). 
RS 


Lemma C.6.6 For every Borel subset A, 


N 
1 
| = (A) = JA]. 
Hi iy Data) [A] 


Proof of Lemma The function 


FOr Í 14(Ly +x)) dv) 
RS 


is continuous, as it is the convolution product of a function in L®(R’) with 
a function in L'(R°). Hence 


1 N 
We) | 80)dy=1Al. o 
n=1 ? 


Now, to complete the proof of the proposition, we use a classical ar- 
gument (which is used to prove a strong law of large numbers in the 
case of orthogonal variables in L°). Let Sy = (1/N) X% (La(Xn)— H(A) 

(a) Sy2 — 0 a.s. Indeed, 


PON > m:|Sy2| > €) < Dae 7 din) - GANT] 


N=m| n=l 


and so limy|Sy2| < € a.s. 
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(b) Sy 20 a.s. Indeed, if M? < N < TAT 


Sv — Sul < AL IN 2M 2 
NM? M 
so, by the lemma, 
ma 
N ms aX) > |A as, 
n= 
and the result follows from Proposition C.6.2 . 4 el 


If the support of the measure v is very narrow, the sequence (X,) ini- 
tially behaves like the sequence (6,). Hence if (&,) has a low discrepancy, 
the sequence (X,) gives a fast integration as long as the required accuracy 
iş not too high. For large n, however, the rate of integration deteriorates 
and the random character prevails. 


C.6.7 | 

Figure 2.3 shows what happens when (6,) is taken to be a Halton sequence 
of dimension 1, 2, 3, 4, or 5 and when (V,) is an i.i.d. sequence with 
uniform law on an interval of width 1, 0.1, 0.01, or 0.001. 

We have computed an approximate discrepancy of the sequence Xn = 
En +V, and compared it with that of a Borel sequence (U n) of the same 
dimension. The irregular graph is the logarithm to the base 2 of the ratio 
of these approximate discrepancies 


i App Disc(Xn) | 
82 App Disc(U n) 


When the width of the support of V, is 1, the sequence X „) is ii.d. By 
Exercise B.4.5. the graph is in a neighborhood of the strip 


T 
l T “par 
— log log log n + log ; < . < log log log n - log 5: 


when the width of the support of Vu 18 progressively reduced from 0.1 to 


P 
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Figure 2.3 


0.001, the approximate discrepancy of (X,) is smaller than that of (Un) 
(the logarithm of the ratio is negative). This becomes more significant as 


the support becomes smaller. 
This phenomenon becomes weaker as the dimension increases, since for 


a fixed number of drawings (here 200,000) the quality of Halton se- 
quences with respect to Borel sequences decreases when the dimension 


increases. 


C.6.8 Remark 
It is natural to ask whether it is possible in Proposition C.6.5 to take the Vn 


to be independent with law uniform on [0, /,] with /, — 0, in such a way 
that (X,) remains Boreloid. We do not know the answer to this question. 
It is easily seen that the /, must satisfy >, /, = +e. 
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Fig. 2.3 (Continued) 


D NUMERICAL COMPUTATION OF CONDITIONAL 
EXPECTATION | 


- D.1 Complements on Conditional Expectation 


D.1. iti xpectation Given a Random Variable 
ee orobebility space. It is often useful ant . 
G-field o(Y) generated by a random variable Y: we will “ie Sih 
place of E(X|o(Y)) for any integrable real aoe varia ee Do 
it the conditional expectation of X given Y. The following property 

easy consequence of the monotone class theorem. 


Proposition D.1.2 Let Y be an R4-valued random en soe 
an integrable real-valued random variable. There exists a Borel fu 


FERI R such that 
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E(XIY)=f(Y) P — a.s. 


If we denote by Py the distribution of Y in Rf, then the function f IS 
unique up to a Py-a.s. equality, and for any B in BR), 


| XdP = Í fF) dP). 
YeB B 


The function f is often denoted by 


f(y) = EXIF = y), 
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but this notation must be used carefully because the sets {Y = y} may be 
P-negligible. | 


) be measurable spaces. A 


Definition D.1.3 Let (Q,.7) and (E, & 
function N:Qx 4 — [0,1] 


Markov kernel from (Q,.7) into (E, &)isa 
that satisfies the following properties: 

(i) for fixed œ e Q, C ~ N(@, C) is a probability on (Æ, & ); 
(ii) for fixed Ce 4,0 ~ N@, C) is .7-measurable. 
We have the following important existence theorem. 


Theorem D.1.4 Let Y and X be as in Proposition D.1.2. There exists 
a Markov kernel N from (RE, ZR“) into (IR,.#CR)) such that 


y 


98 COMPUTATION OF EXPECTATIONS IN FINITE py 


(*) for any bounded Borel function @ on R, 


E[@(X)|Y] = | oconer, dx) Pas. 


(**) for any bounded Borel function h on Rx RI, 


ERX, Y)] = | _ AG y)N(),d3)Py(dy), 
RXR 


Such a kernel is called a regular version of the conditional distribution 
of X given Y. If (X,Y ) has density y with respect to the Lebesgu 


, e Measure 
on Rx Rf, a natural choice for N is 


Í y(x, y) dx 
N(y,A) = <4 


Wx, y) dx 
R 


if fe w(x, y) dx > 0 and N(y, A) = 14(0) otherwise. 


D.1.5 Numerical Computation of Conditional Expectation 
Supppose we want to evaluate the function f in the Proposition above. We 
are interested in the case when we know how to simulate the pair (X, Y) 
but either there is no explicitly computable density y or computing the 
ratio 


Í XW(x, y) dx 
ys 


[ve y)dx 


will take a long time. 

Methods are usually based on L’-approximations, either by approxi- 
mating f with explicitly computable smooth functions or by computing f 
point by point and Storing these values. 


D.2 Polynomial Approximation | 


First we assume that Y is a real-valued random variable. 
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Lemma D.2.1 if Yisa bounded random variable and if the support 
of its distribution is not a finite subset of the real line, then the random 


variables T : | ,Ÿ”,..., make up a linearly independent and total system 
2(Q, o(Y), P). 
in LEE | 


proof (a) From the assumption on the distribution of Y it follows that 
the equality 


a; +a,Y+---+a,Y"=0 


a.s. 
leads to 


ay = a, =---=a,=0, 


and this proves the linear independence. 

(b) Put K = ||¥|l|... It follows from the monotone class theorem that 
every random variable Z in L*(Q,o(Y),P) may be written as Z = f(Y) 
for some f in L?({[-K,+K],-A({[-K, +K D), Py). Continuous functions are 


dense in that space, so for every € > 0 there exists some continuous func- 
tion g such that 


If — gllz2q_x,+Kypyy < €- 


It follows from the Weierstrass theorem that there exists a polynomial P 
which meets 


sup |g(y)—P(y)| <e, 


ye (-K,+K] 
and hence 
|Z - PY)|l Laon,» = II f sa P\ln2q—K+K1,Py) = 26, 
which completes the proof of the lemma. L] 


We shall only state the analogous result for an R‘-valued random vari- 
able Y, First we need a definition. 


Definition D.2.2 A (d — 1)-dimensional algebraic manifold in Rf is 
the set of points y = (y;,..-, Yd) that satisfy the relation 


Y 
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P(y) = 0 


for some polynomial P. 


Lemma D.2.3 Suppose that Y is bounded and the Support of its di 
tribution is not included in any (d — 1)-dimensional algebraic manif Hi 
Then the system of random variables {Y{! --- Y3, ni 20} is linearly a 
pendent and total in L?(Q, o(Y), P). ji 


Suppose that the conditions of the lemma are satisfied. Let H n be th 
C? -dimensional vector space generated by the monomials Y‘'' pu 
where n; > 0 and >. ni = n, and let P,(Ÿ) be the orthogonal Projection 
of X onto H,. As n tends to infinity, P,(Y) converges in L?(Q, O(Y), P) to 


E(X|Y) = f(Y). To compute P,,, we must find the coefficients Anny that 
minimize | 
ny Nd 
dni., na Y | ery A 
Nytetng sn L2 

Let 4 denote the column vector of the monomials Y{' a? ba for some 
order in the index set {(#1,...,n4):n1+--:+n4 <n}. The column vector 
a of the coefficients a,» Satisfies the equation 
(*) ELY A'la = EX]. 


We must calculate the coefficients of the matrices E[Z %'] and 
E[X 7], eventually by accelerated simulation. If we can simulate X and 
Y on a space whose dimension is not too large, then we can use low dis- 
crepancy sequences (Section C.5). The following is an attractive and direct 
method to solve equation (*). 


D.2.4 Conjugate Gradient Algorithm 
We want to solve the system 


(*) Ax=b 


where b is in R? and A is a px p positive-definite symmetric matrix. The 
problem can be considered as a minimization problem for the function 


~~ Reticence — | 
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The conjugate gradient method works by computing a finite sequence 
of vectors X0,%1,---»Xp and directions wo, wj,..., Wp Such that w, is the 
search direction of x,,, from x, and J is minimal at the point x,,, along 
this line; the new search direction w,,1 will be the conjugate direction of 
w, at the point xn+1. More explicitly, by setting 


En = ÅXn =p, 


the algorithm goes as follows: 


(1) the starting vector x, is arbitrary, and wọ = go | 
(2) Xn+1 = Xn + PnWn, Where the scalar p, is chosen so that | 
| 


J (Xn + PnWn) < J (xn + 0w,) for any 6in R, 
(3) Wes = Baii + On+1Wn, Where the scalar 0,,.; is chosen so that 
(Wn+1, AW») =O 


As we shall see, if w, #0, then 


(4) 
re (8n, Wn) 
Pr (Awn, Wn). 
and 
(5) 
eee (2n+1, AW») 
Ot AW Wn) 


We have the following result. 


Theorem D.2.5 The algorithm defined by (1), (2), (3), (4), and (5) 
converges in at most p iterations to the solution x of Ax = b. 
Proof (a) Notice first that if g, = 0 then x, is the solution. 
(b) In step (2), we have 


102 COMPUTATION OF EXPECTATIONS IN FINirp Die 
Oy 


LEA +pwn)=0 
op 


for p = pn, and hence (4) holds. 
(c) In step (3), 


(2n+1 + On+1Wn, AWn) = O, 


and hence (5) holds. 
(d) We will show by induction that, for any n 2 p such that g, 4 0, We 
have for any i < n 


(8n; gi) = 0 
(Zn, Wi) = 0 
(w,, Aw;) = O 
a, 70 

w, #0 

Pn # O 


(e) If n = 0, the first four statements are obvious; we have Wo = go 40 
(otherwise xo is the solution) and 


Po = (So: Wo) 
(Awo, Wo) ` 


(f) Suppose that the statements hold for n. If Ln+1 £ 0, we have 


(8n+1, Wn) = (AXn+1 — b, wn) 
= (AG + Pawn) — b, wn) 
= (En, Wn) + Pn(AWn, Wn) 
=0 by(4) 


e fori<n, 
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(Ln+1» Wi) = (2n+1 — 8n, Wj) by the inductive hypothesis 
= Pn(Aw,, wi) 


=0 by the inductive hypothesis 


e fori<n+li, 


(2n+1, Li) = (nti; Wi—OiW;_;) 
=0 


e for i <n, 


(Wn+1, AW;) = (8&n+1, Awi) + On+1(Wn, Awi) 


= (gm a s) 


=0 
(Wn+1,AW,) = 0 by the choice of, 


Gaps _ (gn+1, AWn) 
(Awn, Wn) - 

7 — (8ne1, Sn — £n) 

7 (Sn+1 — Zn Wn) 


Ign |? 
p (8n+1 — 8n, En + On8n—1) 
_ [Bnei 
7 lgnl? 
> 0 


| Wrst |? = (Las + Ons+18ns 8n+1 + Unr1Ln) 
2 2 2 
= lens + (On41) len| 
>0 
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T (Sn+1,Wn+1) 
(AWns1, Wn+1) 

— [nal 

7 (AWn+1, Wn+1) 

< 0. 


Pn+1 = 


(g) The p + 1 directions g,, for 0 <n < p, are orthogonal. Therefore 
8n = 0 for some n < p. This completes the proof of the theorem. O 


D.2.6 Remarks 


(1) If p is large,we stop the process as soon as the ratio lgnsil°/l gol is 
small enough. | 


(2) We are well advised to compute 0, with double precision. 


D.2.7 Particular Cases 


We can often use classical orthogonal polynomials, which avoids compu- 
tation. 


For instance, consider the case of Legendre polynomials. The Legendre 
polynomials are defined by formulas 


Leo(y) = 1 


N2 1 d” 
Len(y) = (-1)" — OUR Gas 


The collection 
Ley (x) = Len (x) --: Le, (xa), Reis). =) 


is an orthonormal basis in L7((0, 1]4, Au). Suppose that X and Y can be 
simulated by | 


X=f\(U,W) 
Y = f(V, W) 


and (U,V, W) is a u.d. random vector in [0, 1]°! x [0, 1]°2 x [0, 11°. Sup- 
pose also that the conditional distribution N of V given Y is easily calcu- 
lated. Here the desired function f(y) = E(X|Y = y) takes the value 


AE RE 
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FO) = [Noan 


here A(v) = EIX |V = v] (exercise: prove it). So we must compute the 
HUE h, which can be done by computing (using simulation or accel- 


erated simulation) the scalar products 


ax = E[XLe,V)]. 


In L?((0, 11°, Às), we use the expansion 


co 


|n|<k 


h(v) = i > ajLe xv) 
and the final step is to compute the integrals 


fy (y, dv)Le;(v). 
The Chebyshev polynomials 


Toy) = 1 
Py) = V/2cos nO for y=cos@ 


are an orthonormal basis in L?([-1, +1], dy/(a/1 — y?)). They are espe- 
cially useful for problems involving uniformly distributed random vari- 
ables on the unit circle. : 


The Laguerre polynomials 
e d" 


Ln(y) = n! dy’ 


Gen n>0 


are an orthonormal basis in L2([0, œ|, edy). They are convenient when 
exponentially distributed random variables are involved. 
The Hermite polynomials 


A ds A". 2 
Pal y) = aes a AT) n>0 


vn! dy" 
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are an orthonormal basis in L*(R,N(0,1)). They are useful when y : 
Gaussian variable, but X or the pair (X,Y) are not. “a 
D.3 Point by Point Calculation 

In order to deal with the rejection method, we assume that the pair (x y 
can be simulated on a Polish probability space (W, 7/;u) in Which - 2 
know a sequence ¢ = (E,,) such that Di 


ix 
N 28 


converges narrowly to u. For example, W may be [0, 1]? or O LN Fe 
the sake of simplicity we assume that Y is a [0, 1]-valued random Variable 


(it is easy to extend the results to [0, 1]° or R5). 
In practice the real number Y(o) is represented by some binary digits 


and we compute the function f(y) = E[X|Y = y] at the points of this 
dyadic network. Let 


be the canonical dyadic expansion of x € [0, I[; that is, e; = 0 or 1 and 
infinitely many of the e; are zero. For k > 1, consider the truncation map 


T4 given by 
ie Le Lx] 


APF AES zi = DK 


i=] 


We can now state the following approximation result. 


Lemma D.3.1 Let X be an integrable real random variable, and put 


Tr = TY). Then, 


E[X|Y] = lim EUX YA] 


a.s. and in L'-norm. For p > 1, if X belongs to L?, then convergence holds 


also in L?-norm. 
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proof Let :74 be the partition {[i/2*, (i+ 1)/2*[,i=0,...,2*-1)} and 
jet J; be the related algebra. It follows from the equality 
€ t 


V A=.F(0, 1D 
k 


that 


VV o(Y;) = o(Y). 
k 


The proof is completed by applying the martingale convergence 
theorem. 


Let us restate the conclusion in different terms: if 
EY = fx Yo), 
then 


fY) = lim fx) = lim feoTi(¥) a.s. 


So we must compute the functions fg. We use the following method. 


(i) Take a point ¢, from the sequence 6, and compute Y(E,) and X(E,). 
(ii) For k € N, and i = 0,...,2* — 1, compute the ratio m; given by 
the sum of the values of X(§,) such that Y(&,) € [i2-*, (i+ 1)2-| 
divided by the number of such Y(6,). 
(iii) Set 


fd") = mi. 


In fact, it is clear that 


ok 4 


Ms à BIX liye kcistya Ay 
dsfs 2 PY € [i244 DA Liz Léo tie), 
i0) | z i 


and we have 
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E[X1.. 
5 [x llik isna- (Y )] | DA ED 2k (Y 
PY = ae, 
E {12-*, G+ 1)2-*1) ~ lim N | 


à lt ial (Ce) 


n=] 
oe the functions X and L 
Particular, this last conditi 
> thi ition hold ; 

are U-Riemana eee S under the hypothesis that Kak 
the probability law of Y. Ùs negligible ty 
e fi . . i 

e aoe algorithm works when the pair (X , Y) is si 
re 7 i ] 3 nes (En) is given by a regular network of a in 
> 11°. Lhe function f} is stored in a file with 24 entries . pi 
and the 


accuracy to which f is d i 
on f is defined is controlled by the number &. (See Fig. 


i2-k (is1y9-k O Ÿ are U-Riemann inte 


Program EspXgivenY: {computation of E[X|Y = y] = f(y) when the pai 
X, Y) is si air 
EEE ( ) is simulated on [0, 1]4} 
kk=1024; mm=1024; {kk = 2k and mm = 2m} 
Var a: array [0..kk] of real; 
b: array [0,,kk] of integer; {the vector a will contain the values of f and the 


vector b will contain the number of values which 
B are averaged so that the precision can be given} 
procedure ConditionalExp; 
type vector = array [1..d] of real; 
var  v:vector; 
r: integer; 


procedure step {u:vector}; 
var i:integer;x,y:real; 
begin 
% computation of 
y:=Y(u[1], ..., u[d]) % 
% computation of 
x:=X(u[1], ..., u[d]) % 


i:=trunc(kk*y); {what gives the index i such that y be in the 


interval [i/2K, (i+1)/2*[} 


b[iJ:=b[iJ+1; . . 
i iJ)/b[i i : f the X(u)’s for the 
= a[i]+(x-afi)/bli]; {updating of the average of the | 

DE u’s such that Y(u) be in [i/2, (i+1)/2*[} 
procedure unwind(q:integer); {recursive procedure } 


var j:integer; 
Figure 2.4 


Î 


+=. rE i A 
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42 Remarks 
(1) The function f is not calculated at points y that are not visited by Y 
and it is inaccurate at points that are seldom visited; it is then interesting 
to store, in addition to the file which gives f}, a file giving the number of 


computed values ci]. This provides some information about the accuracy 
obtained. 


(2) Would it be more profitable to take for (E, 
sequence in [0, 1]“? It depends very much on the d 

(3) It is important to note an essential differe 
methods, which often work much faster: the metho 
itivity. The conditional expectation of a positive ra 
tive, and so is the numerical approximation obtained by the point-by-point 
method. This may be imperative in some applications, for instance when 
X is a physical quantity like dissipated energy, mass, or inertia. 


:) a low discrepancy 
esired accuracy. 

nce from polynomial 
d above keeps up pos- 
ndom variable is posi- 


a rr Se 
begin {let u run through the discretization of [0,1]4 
if q=d+1 then step(v) and does step(u) of each time} 
else for j:=1 to mm do 
begin 
v(q]:=j/mm; 
unwind(q+1) 
end 
end; 
begin 
for r:=0 to kk do 
begin 
a(r]:=0; b[r]:=0; 
end; 
unwind(1) 
end; 


function f(y:real): real: 


- Var i: integer; 


begin 
i:=trunc(kk*y); 
f:=a[i] 

end; 


begin {main program} 
ConditionalExp; 


% use of the function f% 
end. 


Figure 2.4 (Continued) 
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D.4 Comparison of Both Methods on an Example 


D.4.1 Principle of Dynamical Weighing 
Suppose we want to take measurements from vehicles without stop.: 
them in order to assess whether or not the rules about maximum Ta 
loads per axle-tree are being kept. A device (piezoelectric cable) es 
on the road and, following calibration, evaluates with suitable accum. 
the real force undergone by the ground when an axle-tree runs Over, Th 
movement of the vehicle (which depend on road irregularities, brakin | 
acceleration, and so on) may lead to significant differences between i 
load at rest per axle-tree and the measured quantity. 

Therefore we want to estimate as closely as possible the load at rest per 
axle-tree from the measurements recorded by one or several piezoelectric 


bars. 


D.4.2 Conditional Expectations to Be Computed 
A vehicle can be represented by 


e the vector of static loads per axle-tree 


M = (mı, m™,..., Mr), r = the number of axle-trees 


e the vector of parameters 
A= se), 


which specify the geometrical and mechanical features. 


In the case when three bars are embedded in the ground, the weight meas- 
urements are displayed by three vectors 


Pi = (Piera Die) 
P, = (P21: ..., Por) 
P3 = (P31,..., Par) 


Some features are easily calculated, such as the velocity (since there iS 
more than one bar) and therefore the distance between axle-trees. Other 
quantities cannot be explicitly deduced from the measurements, SO we 
must choose a reasonable distribution for them. Thus we give a probability 
distribution to the pair (M, A) and hence endow the vectors (M, Pi, P 2, P 3) 
with a joint distribution. The problem is then to compute the conditiona 


T 


pP- 


i 
f 


| 
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expectation 
E[M|P, = pi, P2 = po, P3 = P3) =f (Pi, P2, P3), 
and possibly the conditional variance 
E[|M - f (Pi, P2,P3)\"|P, = pi, P3 = p>, P3 = Da]. 


Models are, for the most part, caricatures of reality, but if they are good, then, 


like good caricatures, they portray, though perhaps in distorted manner, some 
features of the real world. [Mark Kac, 1969] 


p.4.3 Application of the Polynomial Method 


We write X and Y in place of M and (P, Po, P3). We may assume that X 
is real-valued, since the conditional expectation is computed component 
by component. We make the assumptions of lemma D.2.3. 

(a) Computer representation of the monomials. The C?,, components 
of Y% must be organized in a linear order by some convenient proce- 
dure. For this purpose the vector of C,,, multiindices is built up by a 
twofold recurrence in n and d. This recurrence takes the form of an array 
of pointers, which is built and run through very quickly. 

(b) Matrix and covariance vector calculation. The matrix E[Z 7711 
and the vector E[X Y ] are evaluated by the classical Monte Carlo method: 
after each drawing of Y and X a new copy of the matrix 7 7! and the 
vector XY is computed and the mean is updated. 

(c) Implementation of the conjugate gradient-algorithm. The matrix 
ELZ %'] is positive semi-definite and so are its Monte Carlo approxi- 
mations. However, the approximation may not be a positive definite 
matrix. The conjugate gradient algorithm provides a minimum value for 
the quadratic form, but this minimum is not, unique and depends on the 
arbitrary initial value. In practice, this (nonunique) value is nevertheless 
acceptable as an approximation of the conditional expectation. 

(d) Output. Once we have run the program, we can compute the con- 
ditional expectation of the weight per axle-tree given the measurements. 
his is achieved quickly and accurately because the conditional expecta- 
tion has a polynomial expression with respect to the recorded data. 


a Comparison with the Point-by-Point Method 
e two methods give the same results, but the point-by-point method, 
While easier to use, has some intrinsic drawbacks: 
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it ee 


eft-half of the square) 
gh the same number 
e right-hand side 


y obtained by pseudo-random numbers (1 
ence (right-half of the square). Althou 
he grey appears darker on th 
ften on the left-hand side. 


Figure 2.5 Gradation of gre 
and by a low-discrepancy sequ 


of points are thrown into the two halves, t 
because the small black spots overlap more o 


s a strong effect on the result; 


e the way we choose our discrete set ha 
he esti- 


e if the random variable Y seldom takes values near y, thent 
mation of E[X|Y = y] 1s very bad, and outliers are observed. 


f the discussion in Section C.5: 
re regularly distrib 


Figure 2.5 is an illustration O in low 
uted 


dimensions, low-discrepancy sequences are mo 
than pseudo random sequences. 
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CHAPTER 3 


Simulation of Random Processes 


In this chapter we deal with simulation methods for computing quantities 
related to the main classes of random processes: stationary, Markov, and 
point processes. Infinite-dimensional representations are quite usual here 
and therefore we must proceed with care and make sure that our definitions 
are rigorous. | 


A INTEGRATION IN LARGE OR INFINITE DIMENSIONS 


In this section we formalize the main notions we shall need in infinite 
dimensions and introduce the Bernoulli shift method. 


A.1 Changing Dimension 


Let (Q1, Æ 1, P1) and (Qy,.42, P2) be Polish spaces endowed with their 
Borel o-fields and probability measures P:, P2. Let ọ be a measurable 
map from Q,-into Q, such that | 


(i) P,{x € Q : g is not continuous at x} = 0; 
(ii) +P, = Pa, 


If f is a P,-Riemann integrable function, then f o @ is P,-Riemann inte- 
grable and if Ẹ = (Ë,) is P.-distributed, we have 


N 
` 5s o @(En) > f opdlPi= fr dP). 
n= 


It follows that @(E,,) is P2-distributed, 
113 
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A.1.1 Example i | 
| | 


For x € JO, 1] let 
r= NE, an € {0,1} 


be the dyadic expansion of x such that D}, 4n = +c. Define the man.. 
Ps :J0, 1] — [0,17 by PPing 
Psx) = (Y1; <- -> Ys), 


where 


— Asn+p-1 _ 
ED o VAS PR 


n=] 
Then @, satisfies (i) and (ii) with P; = A; and P; = As. 


A.1.2 Remark 
It would be interesting to use mappings 6 that transform low-discrepancy 


sequences in [0, 1] into low discrepancy sequences in [0,1]° or [0,1]. 
However, such good mappings are not known, and the dimension of the 
canonical space has an important role in simulation. It does matter whether 


a random variable is defined on [0, 1] or on [0, 1]”. 


A.2 Simulation of a Non-Riemann Integrable Random Variable 


_ Let X be a non-Riemann integrable random variable defined on the space 
([0, 11,.27([0, 11), À). It is not possible to use the variable X directly in 
order to approximate the law of X by simulation. If (€,) is u.d. then the 
average N`! X^, X(E,) does not in general converge to E(X). 


A.2.1 Example 
Let x €]0,1] be given by 


Put 


EO 

pation IN LARG R INFINITE DIMENSIONS 
É 
nt 

1 n 

A(x) = sup — 
az. 
n>1 MN Pa 


ak is not Riemann integrable. Indeed, let 
Th | 


k=1 


E LL 1 
A= freins Ya Lames ob 


we have Ma (A) = 2 ane X(x) < 1 for any x € A, whereas in every neigh- 
porhood of x there exists y that satisfies X(y) = 1. 8 


2.2 Example. 
Let (Xn) be the sequence of coordinate mappings of (1-1, +1], 2 ([-1, 
Apa . Put 


Tue 
| (su Oo Sn 1 0 
Poea l À 


It can be proved that X is a bounded non-Riemann integrable variable. 


Definition A.2.3 A non Riemann integrable random variable X is said 
to be simulated if it is possible for some s € N° U {+œ} to exhibit a 
Riemann integrable random variable Y defined on ({0, 11°, 32 (0, 115), As) 


with the same law as X. 


It iş easier to find such a variable than to obtain an analytic expression of 
the distribution of X (here analytic simply means: involving only standard 


functions). 


A.2.4 Example: Polya Urn Scheme | 
Suppose that we have an urn that initially contains W white and b black 


balls. At each draw we remove a ball at random and then replace it, 

together with another ball of the same color. Let W, and Bn ue a 
number of white and black balls, respectively, in the um mae Ps 
drawing, and write Xn = W,,/(Wat B,), n20, for the De oer 
balls. Note also that Wn + Bn = W +P +n- ‘ arate (0, 1] and 
experiment is given by introducing à Borel sequence (U n)n>1 . 


Writing 
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Xo = w/(w +b) 
1 


Xni = (wt bt tn + Ly 
n+l PERNE )Xn { +1 xn}) 


n>0. 

The sequence (x,) turns out to be a Markov chain. It is also Clear tha 

is a bounded martingale with respect to the past history O-fields Go) 
Oo(Uz; 1 < k < n) = o(x;1 < k < n), and that the martingale Convey “RS 
theorem therefore is applicable. The limit random variable Senge 


Xoo = lim SUP Xp, 
n 


a mapping from [0, 1]" equipped with the product topology into (0, 1) ; 
not Riemann integrable with respect to the product measure d oN. but 
w= b= 1, its distribution is uniform in [0,1], and so x. is simulated. 


A.2.5 Exercise 


(1) Show that in the preceding example, the law of x 
beta-law with density 


is in general a 
+b- 1)! 
(1 agi bar we Dh 
(w—1)!(b- 1)! 
(2) Show that if the urn has initially three balls of different colors, the 
barycentric representation of the ratio of balls of each color converges 


almost surely to a random point in the equilateral triangle with unit sides, 
which is distributed in proportion to the area measure. 


A.3 Effective Computation of the Expectation of Random 
Variables in Z! 


Unbounded random variables are very commonly used in simulation. For 
instance, the log function on ]0, 1] is used to simulate the exponential law 
by 


l 
Xz -7 log U. 


But unbounded random variables cannot be used if the position of poles 
is unknown. Even when the position is known, the result will depend on 
the speed with which the uniformly distributed sequence nears the poles. 
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m 

; se ft? ax = 2 by simulati 
ute Jo l 


Sequence & ~ 
null set A c N* (Le., such 


im sup — ag) 
í Me n 


s, it is often possible to compute the expectation of unbounded | 
Nevertheles Riemanni integrable functions without proceeding to a reduc- 
non- | 
| of oe Riemann integrable case. 
| tion tO 


A3.2 Example 


)nz1 be the Borel sequence obtained by considering the coordinate 
U nzi a 
sere of ([0, 1}°,2..). Put 


Aye 


‘ and Y = lim sup Mal: 
l 


CO 


1 
IY — alik <> a5. 


i=n+1 


ence, 1n order wi isi € > 0, it 1S sufficient 
Y) ith precision 

a n large enough that > ji=n+1 ; l ) < / 

to choose | 3 | € 2 and then to compute 

E(IX,|) by simulation with accuracy € 2 


| 0, 1] ? ) ( P:; i 


tisfies 
j ero that sa 
ence (an) decreasing to Z 

(i) There is a sequ 


> 1. 
| \¥|dP<a, Yn 
|Y |>n 
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Such a sequence is called an effective bound, D: 
(ii) For every k > 1, there is a sequence ( Ínkn Sija T 

(resp., cylindrical polynomials) with rational coeffici ents nom; 

the convergence to zero of SUch 4, 


IF, (A k) V (—k)) Ir 


is proved with an effective bound. 


A.4 Integration of Functionals of Stochastic Processes 


It is often necessary to deal with random variables that are ee 
defined in ([0, 1]*, A..). For instance, let (X,) be a Markov chain i Pi 

Neq 
by 


Xn+1 = F(X n n, Ons), Xo =X, 


where F: R? x N x [0,1] — Ris given and (U,) is à Bore 
sequence in [0, 1]*. Actually (X,,) can be defined on the Probability Space | 
([0, 1], A4)8N", where the U, are the coordinate Mappings, | 
We want to find good double sequences | 

` | 

| 

| 

| 

| 

| 


ity = (ns un, .….)€ ([0, 1 


| 
> | s | 
Uy = gupa € ([0, LN | 


in such a way that, for H : (RÒN -; R, the average 


N 


1 
x D AXi) 


J=1 
converges rapidly to [H(X.)]. 


A.4.1 Exercise 


Suppose that (6,) is u.d. in [0, 1], that (V;) is a Borel sequence in [0, 1} 
and let 4y= [Vr +k] for i,j e N°, Show that the sequence 
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à not almost surely À distributed. 


A4.2 Exercise 

: . k ge ei 

suppose’ = = [O, 1] for - in N and let ia en" Sh n 
sequence (Mj = Waij» - -Mijs - - -))jz1 18 p-distributed i E 
the sequence uted iff, for every q e N*, 


Ta 
(uj = Uys... , qj))jz1 


is Ae -distributed. 


A43 Remark 
The double sequences that are at present known to yield A..-distributed 
sequences, for instance, those that are obtained from Halton sequences by 


putting 


Uij = Pp; (J) pi = ith prime number, 


are inefficient in practice. 


A.4.4 Exercise 
Let X, be a Markov chain with values in E = R* and suppose that (Xn) 


is the sequence of coordinate mappings in the canonical space EN, Let C 
be a compact subset of Æ with boundary dC = C\C and entrance time 


T = inf{n>0:X, € C}. Suppose that 


(i) P(T < >) = 1; 
(ii) P(Xr € dC) = 0. 
int: If X7(@) € C, we have T(@) = 


Show that Xr is Riemann integrable. [H 
d Xr is continuous at point @.] 


T(w’) for any œ close enough to œ, an 


A.4.5 Remark 

The notion of a P-Riemann integrable function depends on the topology. 
If several natural topologies give rise to the same Borel Leonie such as 
in the Wiener space setting, the associated classes of | anan ov 
grable functions may be more or less restricted, or even incomparable (cf. 


Chapter 5). 
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a N if and only if for every k the sen ED 

, 0, 1]. For every k the discrepane 
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A.4.6 Discre À 
By Exercise À.4.4; @ 
ue where P = on [ 
+s first k coordinates 18 U-®- ~ -define 
its Nr (E, N) is well-de 


projected sequence D 


* (k) 
( is P — distributed) © (Vk € N DAEN oii 0). 


iti that > 


= (a DPE, N) = o) | 
N—c 


k=1 


This is valid when D® is any of the discrepancies DZ or D*, for T 
[1,œ[. This weighted criterion is, however, not symmetrical with Tespect 
to coordinates. We can illustrate this more explicitly as follows. 

Let .F be a class of functions from [0, 1] to [0, 1] and let (ee) 
be a sequence of points in [0, 1]. Consider the criterion nae 


1 N-1 
[rar- = &] 
0 


A= 


DE, N) = sup 


JEF 


The following result is due to Sobol’ [1974]. 


Pr oye 0 | 
7 ee eae Let Xii € N, be the coordinate mappings of 
Fe ja n 7, contains the functions fj; = (X; — X j), Vi,j € N 
s no io 3 
vi exist a sequence E such that DE, N) tends to zero as 


uy eh an N such a DA res of points in [0, 1]§. Suppose that 
t M be an int ee et aE LA 
intervals J, of ut yay esas the partition of [O, 1] into 2” dyadic 

ki = 2%, Forxe [0, 1], let k(x) be the index of 


the interval /, s 
k Such th: 
hat x € Jy. Each column in the following matrix 


GRATION JN LARGE OR INFINITE DIMENSIONS a 
nT , 
k(Xo(Eo)) EEE o  E(X: 
ko) EE) ce. 


k(Xo(En-1)) KAn- ... KAEN) 


an arta with N components taking integer values between 1 and 2M 
j are are thus two identical columns in the matrix, say, columns i and j 


Th 
Then 


1X i(Sn) — Xj(En)| = 2 Yn = 0,...,N — 1. 


Hence 


1 N-1 
DRE -X EN ST. 


n=0 
Thus the function fij = (Xi — X;)° satisfies 
jia i 
fruar = yD SitGn) > > aa 


n=0 


Since fi; € F Vi,j and M is arbitrary, we get 


sup 
fer 


which contradicts the choice of N. Therefore D“E&,N) is bounded away 
from zero. L] 


AS The Shift Method 
The infinite- or high-dimensional case turns out to have its own features 
that allow us to avoid having to take an average on completely different 


Sample paths. 


AS.1 An Efficient Method 

Although the pointwise ergodic theorem has 
si (such as the iterated logarithm law), We 
ems at present to be the most efficient for computing 


no typical speed of conver- 
give here the method that 
high-dimensional 
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rocesses: the implementation of this ES | 
M | 


functionals of stochastic PI | 
f the notion of pointer. 


makes great use O ) | 
Let us consider the preceding Markov chain 


Xn = F (Xn 1, Un+1); 


| 
Xo =x, | 
| 


] sequence in [0, 1] given by the coordinate 
Map, 


where (Un) is the Bore 
i f ({0, 1]°, À æ). 
pings of ([0, 1] time T = inf(n>1:X, © 4 


Consider, for instance, the hitting 
the functional G(XT; T). The expectation E[G(X7,T)] can be comm 
ed 


by Birkhoff’s theorem 
N-1 
1 
Pisin G(Xr,T) o 0” a.s. 
E[GXr,T)] Hoy 2 


the shift operator satisfying 


C9) 


whenever G(Xr, T) € L', where @ is 


Vo e [0,1]” U, o 0(@) = U»+1(@), Vn>1. 


The use of pointers can be explained as follows. In order to apply (+) 
we have to compute G(X7,T) at successive points ©, 6(@), 07(@),... in 
[0, 1]°. These points are sequences of points in [0, 1]: 


œ = (U;(@), U2(@), .. .. Ux(@), - - -) 
6(@) = (U2(@), U3(), sey U r+ (0), se .) 


i as we suppose here, the stopping time T is finite, we need only compute 
a finite length of each of these sequences: the sequence œ is computed until 


T(@), which is given by the test 


X;(@) still outside A = k < T(@) 
(test) 
X,(@) € A for the first time = k = T(o). 


ao we have picked out the sequence œ = (U1, U2,.--); and WE 
A oe fe He UU 2,-.. in pointers indicated in Figure 3.1. The 

q x (œ) = (U2, U3,...) is already partially chosen: either it is long 
enough or it must be lengthened, in which case we have the new scheme 


(Figure 3.2). 


E 
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Xo XN ~ Xx, 


Figure 3.1 
X > XN Xo AUS Xp TPS Xe 


Figure 3.2 


In these figures the arrows ~> represent computations to perform, 
specifically, to compute 


X n+1 = FX nl; Uni). 


When we compute F(x,n, y), however, only the variable x EN es 
new and partial computations depending only on y can be store ene 
box of U,, in such a way that the remaining ek ale i sialon i 
storage must of course be made while we are lengthening the seq 
explained above. 
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A.5.2 Remarks : Sa 
(1) We shall see later in this chapter and in Chapter 5 that | 
in general many different ways to simulate a stochastic Proce th | 
on different representations on the space ([0,1], .Z7([0, 1}), À yen’ 
case of Brownian motion, this is considered in Section F of Ch N the | 
The efficiency of the shift method may depend strongly on the é $ | | 
representation (cf. Chapter 5, Section B.4). Oj 
(2) The results for Markov chains are of course valid for à lar 
of stochastic processes or stochastic fields, which, in most cases © Clase 
defined in a natural way on ([0, 1],.A([0, 1]), à1)®™ after discretizar, e 
necessary. 10n if 
(3) It may happen that the process whose functionals are studieg | 
itself mixing or simply ergodic. In this case, the shift methog cn : 
applied to the process itself. This works as soon as a sample path of a 
trary length can be simulated. j 
(4) The method can be slightly improved by using the tWo-sideq 
Bernoulli shift. The idea is to simulate two independent copies of the Pro- 
cess, one with the classical right to left shift, the other with a left to right 
shift beginning with the box left by the first one. The benefit is negligible 


though, in high dimensions. 


A.5.3 Connection with Normal Numbers 

For the sake of simplicity, let us replace [0, 1] by {0, 1}§. The best points 
for integration by the shift are fast normal numbers. Recall that a real 
number x in [0,1], or a sequence of binary digits, is called normal if 
the sequence 0”(x) is -distributed on {0,1} equipped with the measure 
= (3 80 + +61) 2%. Explicit normal numbers are known, for instance, 
the Champernowne sequence. Other normal sequences are known, which 
are generalizations of Champernowne’s (e.g., see Rauzy [1976, 1981] or 
Dumont and Thomas [1986]). Some of these constructions make use of 
(s,m)-de Bruijn sequences, which are periodic mappings u from N into 
{0,...,m— 1} of period m*-such that the m° strings of length s 


(u(n),...,u(n+s + 1)), W220; 1..53, 00 =, 


are all different. Algorithms are known for generating infinite de Bruijn 
sequences (see Ivanyi [1988/89] and its references), that is, infinite 
sequences whose initial sections of length m5, considered as loops, af 
(s,m)-de Bruijn sequences for every s. 

The efficiency of these sequences is difficult to evaluate, and is largely 
an open question. One of the main difficulties is to summarize the perfor 


g 
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nc | 
e of a sequence by a single scale, when there are 
discrepancies, one for each finite product nies naturally infinitely 
e. 
glajolet et al. [1988] give an asymptotic result on th 
jp shows that, in a certain sense, random it 


hic seque 
veing the best. quences are not far from 


ma 


iå Rate of Convergence for the Shift Method 


contrast to the case of the law of large numb j 
” ed of convergence that holds for mele Bou RP era 

e pointwise ergodic theorem. Nevertheless, the presently spina A 
ions that integrate very slowly or very quickly by the shift method ae 
constructed by suitable applications of the Rohlin-Halmos lemma. For 
K ommonly met functions a law of iterated logarithm holds with a new 
coefficient different from the standard deviation. We quote here some 
results that are derived from the theorems of Gàl and Koksma [Ben Alaya 
1992] and Heyde and Scott [Bouleau (6)]. | 


4.6.1 Slow Functions and Fast Functions 

Jt is known [Halasz, 1976; Krengel, 1978, 1985] that for every ergodic 
endomorphism 6 of a Polish space (Q,.@,P), and for every sequence 
(a,,) such that Œn —> 0, there is an f in Z? such that 


N-1 
am LS 7208 POSE a.s. 


The existence of such a “slow” function is proved by using the 
Rohlin-Halmos lemma [Cornfeld, Fomin, and Sinai, 1982]; similarly, for 
every € > 0, the lemma furnishes a non constant “fast” function f € L 
for which 


re 1 
1S poo! Ep =0( ge) a.s. 


n=0 
A.6.2 Functions Adapted to a Stopping Time | 
Let (Q,.,P) = ([0,1],.#7(L0; 1]), A). Let (Xn)n20 be the coordinate 
mappings on Q, and define the shift 6 on Q by 


X,09=Xne1 yne N. 
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For n > 0, put 
FE OA bis A 


and let T be a stopping time with respect to the filtration aig 
We are interested in the .7}-measurable random variables. Such ors, | 
tions are very commonly used in simulation problems, and they ty func. | 
be the broadest class of random variables that can be simulated (cf o to 
ter 5). * Chap. 
For F in L7(Q,.4, P) with E(F) = 0, put | 


s(F) = (EF) +2 È EF 0 F2. 
k=1 


The following statements are proved in Ben Alaya [ 1992]. 


Theorem A.6.3 Let T be a.¥-stopping time and assume T has a finite 
moment of order p > 2. If F € L?, E(F) = 0 and F is Z 7 -measurable 


then: 
(i) 
E(F?)(E(T?))/? 
|E(F o 0%.F)| < = pes ol 
(ii) if s(F) > 0, then 
1 N-1 y 

—— ) Fo6"5N(0, 1); | 
s(F)/N 2 | 
(iii) for every € > O, | 
N- | 
2 XF o 6” = (N7? (log N)V*) a.s. | 
N n=0 | 


From the standpoint of computer efficiency, it makes little difference 
whether we shift forward or backward. However, theoretical results a | 
more conveniently stated with the operator tT = 07!. We consider a Pro | 
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ability space (E, E, p) and put 
(Q, 6, P) = (E, &,)™: 


the coordinate mappings are denoted by X, (n € Z) and 7 is defined on Q 
by 
An OX Vne Z. 


FO A OX TO CTD. m<n 
FEF OK Rk Sh) 

A ae = 0(X;,k > m) 
HF = 6(Xp,k € Z) 


Let T be the operator defined on L'(Q,.F 5”, P) by 
Tf=Elf|. Fi] 0%. 


Lemma A.6.4 Forf € L(Q,.7%°,P) with E(f) = 0, the following 
conditions are equivalent: 


(i) ~, T"f remains bounded in L’. 
(ii) $^, T’f converges weakly in L? as NT. 
(iii) DA, T'f converges in L? as N Î œ. 
(iv) There exists g € L’(Q, F 5", P) such that f = (I —T)g. 


We shall say that a function f € L?(Q, F 3”, P) belongs to the Gordin 
class (which we denote by $ ) if f — E(f) satisfies the conditions of the 
above lemma. 

Lemma A.6.5 The Gordin class ¥ is the class of functions f in 
L*(Q,.F 3”, P) which admit a decomposition | 


(*) f-E(f)=gthot'—h 


where 3,h € L2(.F%*) with Elg|-F 11 = 0 and E(h) = 0. Such a decom- 


Position, if it exists, is unique. 
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Ro 
a (6)]. By Using Sy | 
aw of the iterateg “Oren, | 
fe LO JZ + Baie + 
| > 0 > P) be in the G š r, 
of its decomposition (*). Put Ordin Class 


These lemmas i 
are easily proved [B l 
Heyde and Scott [ 1973], one can rain : 


Theorem A.6.6 Let 
let Š, A be the elements 


ang 


N 
Sv= X F-E) or. 
n=0 
Then 


@ limyt (1/VN)||Sy [lo = || Ilo: 
(ii) limyt..(|Sv|/(2N log log N)!/2) = II. 


We refer to Bouleau [(6)] for a Proof and for sufficient ” 
ensuring membership of the Gordin class & ronditions 
In the case of [0, 1}, the membership of the Gord 


expressed in terms of local smoothness. Consider the fol 
phism of [0, 1]° x [0; 17°: 


in class can 
lowing automor. 


(x1, .. 4) (yi, ajs sYs)) 


~> (2), (2%), An] +9). 1((2x,]+y,)), 


This automorphism is sometimes referred to as the “baker’s transforma- 
tion” and it is easily seen that it corresponds to the bilateral Bernoulli shifi 
on the binary expansion of real numbers. 


[La transformation du boulanger] Il ne s’agit pas d’une transformation 
dynamique hamiltonienne, mais, étant donné qu’elle préserve la mesure, on peut 
s’en servir pour illustrer de nombreux aspects de flots hamiltoniens.! [Ilya Pri- 
gogine, 1980] 


Proposition A.6.7 Let f be in L?([0, 1]°,A;) and have Fourier expan- 
sion 


FO)= Ÿ ane 02). 


me ZS 


i ics, but 
miltonian dynamics, 


1 [The baker’s transform] is not a question of a transformation in ha onian flows 


: ‘ amil 
since it is measure-preserving it can be used to illustrate many aspects of h 
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) If there exists a sequence (cn), Cn = 0, Do Cn < œ such that 


(i 


|aznm| S Cnlam| Vme Z°, Vn>0 


then f € F and the function g in its decomposition (*) satisfies 


oo 


allo < IIF - EP Dien 


n=0 


(ii) If there exists & > 0, which satisfies 


Z at (Set) ae 


pe Z 


then f € & and there exists a constant c > 0, independent of f, 


such that 
1/2 
aed Zee E) ) | i 


pe ZS 


es 


In the case of the infinite-dimensional torus [0, 1], membership of the 
Gordin class is clearly related to the size of the derivatives of f with 
respect to the coordinates with large index. A Dirichlet form method (cf. 


Section A.8 below) yields the following result. 


Proposition A.6.8 Put (Q,.4,P) = ([0,1},.2([0,1F),À)%2. Let 
fe P(Q, 7 $°, P) have the following property: for every n € N, the 
map 


[0, 11° 3 x, > f(o,--.-.4ns--J E R 


has (dxo - -dxn_1dxys1 <- -)-almost everywhere a derivative in the sense of 
distributions belonging to L*(dx,). If 
1/2 
wi) Hi 


Sž" 


k=0 
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or if 


D i7(log” NEVVLI7I < 00 


j=2 


for some a > 1, then f € G. 
See Bouleau [(6)] for a proof. 


A7 On the Shift Method in Finite Dimension 
The shift method can also be useful in finite dimensions, and à bit 
can be said in this case. More 
A.7.1 Case of the Shift Method with a Borel Sequence 

Consider f : [0, 1]? — R seen as a function from [0, 112 to R that de 
only on p coordinates. As we have seen, if f € L?(dx) then f belo Fa 
the Gordin class, and this is true for both shifts @ or CRE à 88 to 

We introduce the notation 


Var f = |léll2, 


where 
f-Ef=gthot'-h 


is the Gordin decomposition of f (with respect to t). Var f i 
S : can easil 
be computed by computing ||Sy||z and looking at limy7..(1/N)||Sy|[2; i 


obtain 


Var f = Var f +2 X Cov(f,f o tt), 


k=1 


which of course is equal to 
p-1 
Var f + 2 Ý Cov (ff o 0%). 
k=l 


Thus Var f does 
can be vee s not depend on the direction of the shift. Note that Var f 
re greater or less than the variance Var f 
» H J 1s modifi i i 
dified by a permutation x of its coordinates | 
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fr, . Xp) = fn), . 


ee Xm p)), 


uantities Var fn and Var f are in general diffe 
en ene of applying the simple shift, we apply the do 
if, inste then the quantity Var f will also change. 
shift ©"; e have p.p! methods for computing Ef (the shift 6P is the clas- 
Pease Carlo method), which in general have different coefficients 
sical law of the iterated logarithm. . 
jn us lly we quote the result that, if f is bounded, the shift has an expo- 
a ae of convergence in the sense of Chernov theorem: Y £ > 0,2C < 
ae y> 0 such that 
+ 


rent. Furthermore, 
uble shift @2 or the 


N-1 
‘ S CE m, 
P D > Ef + 


ift wi i istributed Sequences 
of the Shift with p-Uniformly Distri ence 
oe E = (n) of points in [0, 1] is said to be p-uniformly distributed 
A se = (Sn 
if the sequence 


z = Ci Cai tees Entp-1)n>1 


is uni ly distributed on [0, 1P. . . 
j a a Riemann integrable function f : [0,1]? — R by using a 
p-uniformly distributed sequence in the formula 


N 
I : ie n+p-1) 
Ef = lim y DS Gn bre Ep 


in 1 ion, namely, being 
clearly presents the same advantages in RE are a m sae 
able to store partial calculations, as the shift metho i EER A 
sequence. The rate of convergence depends here on 
| ing Van der 
sa . d. sequences can be constructed by the following 

uch p-u.d, § 

Corput criterion. 


i ; f real numbers 
at u = (un) is a Sequence of | 
D es ae i istributed modulo 
T : the ane Unsp — Un Is Sea i 
l. Then the sequence (un) is uniformly distribu 


P 
Key fie 


This result can be proved either by expanding the square ~M 


2 
I 2iTuUy 
l7 2 ‘ 


and estimating the resulting rectangle terms [Rauzy, 1976] or ee 

correlation functions and Bochner’s theorem [Rauzy, 1976; pe a 

We get the following result. 3284) 
Proposition A.7.4 Let P(x) = ao+::-+a,x/ be a polynomial of de 

p = 1 with a, irrational (Weyl polynomial). Then the sequence P(n) 3 

uniformly distributed modulo 1. a 


Proof By Weyl's criterion (Chapter 2, Proposition A.5.3) it suffices to 
show that for every m = (mı, ..., mp) € ZP, m + (0,...,0), the sequence 


ün = m P(n+1)+---+m,P(n+p) 


is uniformly distributed modulo 1. We have 


i=] 


p p 
i= ( > m) apr + > m(Cidp + dyn?! +... 
i=l 


p 
se > m;[Chi ap + CET ap-1 + ++ + ap +- 
I=] 


p 
-+ > mili az +P la; +... +ao]. 


i=] 


The numbers 


a | 
> mi  k=0,1...p-1 | 


i=] 


cannot all vanish. u, is a polynomial in n whose term of highest degree | 
is of the form 
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p 
> ak ak 
mil Cpap, 
i=] 


where k is the smallest integer for which 5? 


i nm 4 O. It i 
rove that when Q isa polynomial of degree > t such hee oo i 
highest degree in Q has irrational coefficient, then Q(n) is u.d. mo ue 


if Q has degree 1, then it is an irrational translation on the t 

was dealt with in Chapter 2, Section C.3. If Q has degree f e 
Yp 2 1, Q(x +p) — (9,09 IS à polynomial of degree d — 1 whose enn of 
highest degree has an irrational coefficient; by the inductive hypothesis 


Q(n + p) — Q(n) is u.d. modulo 1, and so Q(n) is u.d. modulo 1 by the 
lemma. L 
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A.8 Complement: Quasi-every Sequence Is Shift Uniformly 
Distributed 


Consider the probability space ([0, 1],.A((0, 1]), A )8. A sequence @ € 


[0, 1]* is said to be 6-u.d. (also “normal” or “completely u.d.”) if the prob- 
ability measures 


Zl 


N-1 
by de" (w) >, 
n=0 


where 0 is the shift operator, converge narrowly to A®N, It follows easily 
from the pointwise ergodic theorem that APN almost every sequence is 
9-u.d. By analogy with the case ({0, 1},.A({0, 1}), {$8 + 481, })®, it is 
nevertheless suspected that there are more 6-u.d. sequences than “random” 
sequences. Actually, as we will now see, the set of non 6-u.d. sequences 
has null capacity for a large class of capacities on [0, 1]\. 


A8.1 Dirichlet Structures | 

Let (Q,.F%, m) be a probability space. We impose extra structure on it by 
adding a Dirichlet form € with domain D, that is, a symmetric nonnega- 
live definite bilinear form f, g ~> #(f,g) defined on a dense subspace D 
of L*(m), which satisfies the following two conditions: 


(i) & is closed, that is, D is complete with respect to the norm 


IF lho = (hale + ef LY: 
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(ii) iff € D, then f À 1 e D and “Stag 


EFALFAD<E(F,f). 


We obtain what is called a Dirichlet structure Q, Fm, & D 
be a contraction from R to R (i.e., F(0) = 0 and IFœ)-F(y)| < | Ley F 
every x,y in R). It can be proved [Fukushima, 1 aie 


980; Boul XI for 
1991] that if f €e D, then een Bir 


Sch, 
FofeD and &(Fof,Fof)< ETF), 
We shall in addition assume that 


(H) 1e D and &(1,1)=0 | 
(T) Q is a topological space with Borel o-field .F. 


A.8.2 Examples 
(1) Let Q = [0, 1], F=.A({0, 1]),m = À 


D - H!- { ue L*(A,), wis absolutely continuous with | 
1 
derivative uv’ and Í (u’(x))* dx < œ 
0 


l 
E(u, v) = f u(x)v (x) dx. 
0 


Proof of (i): Let (u,) be a Cauchy sequence with respect to the norm 
[[.I!p. Then there exist u in L? and f in L? such that u, — u in L? and 
u, > f in L’. It follows from the inequality 


2 
[u(y) - u(x)f = f “wa < px | (u(t) dt 
x 0 


and the boundedness of the sequence ||u’||, that the sequence (un) 1$ 
equicontinuous. Thus there is a subsequence (Uy, ) that converges uni- 
formly to a continuous function à = u a.e. Let ọ €e YF vanish outside 
J0, 1[. Then 


| 
1 
$ 


f 
i 
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1 1 
fodx=lim | Mode he 
Í, nteoJo ae P Mi dx=—| 


1 

~ oP 

uE dx. 
0 


a is absolutely continuous with deriv 
& is closed. 
The proof of (ii) is left as an exercise. 
(2) Product spaces. Let ([0, 1],.A({0, 11), u, e 
ture. We shall define a new Dirichlet structure i 


Thus ative f, u 


n > U in D and so the 


d) be a Dirichlet struc- 


(Q, F,m, &, D) = ([0,11,.2([0, 1)), p, e, dye 


(Q, .¥, m) = ([0, 1], Z0, 11), ye 


and D to be the set of functions F : [0,1]’ — R such that F e L? (m) 
for every n € N and for almost all fixed ao, @y,.. sa On—1, One,» . the 
function i 


x œ> FO Or Ded 


belongs to d and 
[Zeer dm < ©, 


where e, is the form e acting on the nth coordinate. Finally, for F € D, 
put 


&(F,F)-E [Eeer] (- [Mew in) 


e n=0 


Proposition A.8.3 The form (4, D) is closed. 


quence in D such that || fn—-fll2 — 0, 


Pr | hy se 
oof Let (fn) be a Cauchy ) such that 


Where f € L?(m). There is a subsequence (fn 
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Dili — f || < æ 


and 
EU, he 
k 


It follows that for any i € N, for almost all Wp, ..., Oi-1, Wj4 we h 
> Desay ave 


(a) > IP. (Oo, ce.) O1, X, O1...) (Obs: 


“9 Oj-1,X, i+] <i .))? 


AUX) < 00: 
OD) CS ma Safi, Fae, a 


Assertion (b) follows from the inequalities 
El(e(F, F))™?] < (Ele(F, Fy)? < (&(F, Fy). 


Since the form (e, d) is closed, we deduce that for almost all 


Oo, ---, O1, Wi+1,.--,f(Wo,..., @i-1,., Ojx1...) € d and CF ny -f fmn- 
f) — 0. Therefore by Fatou’s lemma 


ab) UN] =E bs lim nfn) 
i=0 


1=0 
< lim AS nf nz): 
k 


Since (fn) is a Cauchy sequence in D, the last term is finite and f belongs 
to D. Another application of Fatou’s lemma shows that | 


m 


E> ef -fuf -fn)= E Ÿ lim (Fm -fmf m = fn) 


1=0 i=0 | 


co 


i=0 


ie 
= lim CA (fn —f nsf ny — fn), | 
| 


k 
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which can be made < € for large enough k and n because f 
> n 


sequence for Lo. Thus fn > f in D. is a a 


A.8.4 Capacities 
Let (Q,.7, m, 6, D) be a Dirichlet structure. If G is an open set in Q, we 
define ; 


c(G) = inf{|lu||2;u€ Dandu > lg}. 
If A is any subset of Q, the capacity of A is defined by 
c(A) = inf{c(G) : G open, G = A}. 


Lemma A.8.5 Let u,ve D. Then uv v,u Ave D and 
lu v vil + Ilu ave < lull? + lvl. 
Proof We have 


uVVy= (u + v) + |u — vi)e D 
u^ v= }((u+v)- |u- v|)e D 


and 


luv vllo + lluavli5 = 3Cllu + vll + Illu — vil) 


< 4 (|Ju + vilà + |lu — vll2) 


= |lull5 + Ivy. = 


Proposition A.8.6 


(i) VAE F m(A)<c(A)<1, 
(ii) VAi,4 CQ, c(Ai U A2) + C(A; N A2) < c(A1) + c(A2), 
(iii) VA, COA T, C(U nAn) = limyt C(An), 
(iv) VA, CQ ANS > tn: 
Proof Assertion (i) follows directly from the definition. Assertion (ii) 


follows easily from the lemma and the definition. For (iii), we remark that 
if G is an open set and eg is the orthogonal projection of 0 on the closed 
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convex set {u € D;u 2 1Ga.e.} in D, then eg 2 1g and |jeg||? 

If (G,) is an increasing sequence of open sets, then, by the Bs G 

projections in Hilbert space, we have ec, —> @U,G,, and thus ropes GE 

lim, T c(G,). Finally, if (A,) is an increasing sequence of subsets.” va) s 

obtained by using (ii) and approximating each A, with an Open sub (ii is 

in the sense of capacity. Assertion (iv) is a straightforward Consequs n 
NCe 


of (ii) and (iii). 
Proposition A.8.7 Let f be a continuous function in D. Then, fo 
a > 0, "ay 


ctii > aj < HB, 


Proof The set G = {|f| > œ} is open, |f| € D and ied yd ee 


A set of capacity 0 is said to be a c-null set. Relations or convergences 
valid outside a c-null set are said to be valid quasi-everywhere (q.e.). 


Proposition A.8.8 Let (Q, F,m, 6, D) be a Dirichlet structure that 
satisfies (H) and (T). Let (X,) be a sequence in @(Q) N D such that 


o EOX,)=2@ Vn 20 
e (X,— a) is an orthogonal sequence in D 


e sup, lXnllp < 2%. 


Then 


in D and quasi-everywhere. 


Proof Put K = sup, ||Xnll7, + 4°) < ©. 


(a) Since 
ye all? s$, 


D) n=0 


1S, =al 


n=0 


convergence in D is clear. 
(b) We claim that 
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We haye 
“Ase? > Sm? 
M = m2 
m>M D 
1 
=) 
m>M 
Putting 
3 2 
Ana fiim sup a > ef - a Arps 
M21 
we have 


c(A&) < c(Ay)  foranyM 21, 


and thus c(A§) = 0. If 


Sol 
nie 


>of - J ALP, 


p2! 


A= f iim sup 
m 


it follows from the subadditivity of c that c(A) = 0. 
(c) For n> 1, put 


m(n) = sup {m : m <n}. 
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It remains to show that 


1 1 
Sn 


a au OS 


for which it is enough to show that 
l 
z |Sn z Sae 0 ge: 


For € > 0 and N 2 1, put 


n>N 


1 | l 
B= J Lise Snol >e}: | 


Then 
cB se? Y Un - m(n)?)K 


| 
| 
n>N | 
| 


and the argument used in (b) yields 


1 
e( tim sup =|S, = Saol > 0) =0. E 
n n 


We return to the shift operator 6 on [0, 1]\. 


Proposition A.8.9 Let (e, d) be a Dirichlet form on ([0, 1], -%2 ([0, 1]), 
À) such that #([0, 1]) c d. In the Dirichlet structure ([0, 1], 22 ([0, 11), 
l,e, d)®N, the set of sequences œ € [0,1] which are not 0-u.d. has 
capacity 0. 


Proof (a) For k > 1, let f : [0,1] — R be in C! and put 


& =f (Xo,.-->Xk-1) 


r 
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J > 
LS (go8" - Re Lee 
ve D= T| È De - Bie) ogren 
á h=0 m=0 
N-1 
+ Š G- E@))o | 
i=pk 
The first term iS 
k-1 p-l 
p 1 
N p DC = E(g)) o grunt, 
h=0 m=0 | 


and from the previous proposition it follows that this term converges to 
zero q.e. since, for A = 0,...,k — 1, the sequence 


((g — E(g)) 0 80), 


is orthogonal in D. The remaining term converges to zero everywhere 
because g is bounded. 

(b) Now let k run through the natural numbers. There exists a countable 
set D of functions g like that in (a), which is convergence-determining in 
(0, 1]". Since, for any g in D, 


N-1 

1 n 

wy Dye > EG) ge. 
n= 


it follows that 
1 ua @N 
2. DETTE 
N n=0 


Outside a c-null set. 


Comment 


@N that is, 
Cierra 


that a 
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N-1 

1 
lim — X po = APN narrowly, 
yfe N 2 6"(@) 1 y 


The points in G are also called generic points, for the shift accord; 

the measure AN. G has its complement of zero capacity for any Es to 
satisfying the hypotheses of the preceding proposition. This wa | 
itively that this set is strictly bigger than the set of points œ = (y F ty | 
whose coordinates @o, @;,... are “randomly and independently” drawa ) 


To see this more explicitly, let us consider the space of Brownian w 
equipped with the capacity associated with the Ornstein-Uhlenbeck sé a 
group. We represent the Brownian motion by a series with respect ‘ts 

an 


orthonormal basis y, of L?(R;, dx) by 


co t 
B(t) = > £nVn(t) with W(t) = Í Xn(s) ds, 
n=0 0 


where the g, are d-dimensional Gaussian independent reduced random 
variables (cf. Chapter 5, Section F and Bouleau and Hirsch [1991}). 
The Brownian motion is represented on (R, A(R), N(0, 1))®N and there- 
fore also on ([0, 1],-A[0, 1], (A;))®N. The capacity c associated with the 
Ornstein-Uhlenbeck semi-group is of the type found in Proposition A.8.9 


and by a result of Kono [1984], for d < 4, 


clo: lim |B,(@)| < 1} >0. 


[00 


If a sample path is “randomly” drawn it is well known that it is transient 
for d > 3. Thus a point in G that corresponds to a recurrent path for the 
Brownian motion of dimension 3 or 4 is not “random.” 

Explicit sequences in G are known [Rauzy, 1981], which are given by 
an algorithm (and are therefore not random). But no such “deterministic” 
sequence at present is known which is significantly faster than random 
sequences and could play the same role for the shift as low-discrepancy 


sequences play for finite-dimensional integration. 


B REPRESENTATIONS OF STATIONARY FIELDS 


This section is an introduction to the simulation of stationary processes 
and stationary fields. For more details we refer to Azencott and Dacunha- 
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Castelle [1984], Shiryayev [1984], Rosenblatt [1985], Duflo [1990], and 
Preumont [1990]. 

Generally, methods for the statistical identification of Stationary pro- 
cesses OF fields are dealt with in a wealth of literature, which provides 
representation and convergence theorems that are very useful for simula- 
tion. We shall not tackle such methods here. 

Throughout this section, except in Section B.7, random processes and 
fields will be centered (zero mean) and have finite second-order moments. 
Thus we confine ourselves to the L?-theory. Extensions to more general 
situations are often possible but we do not discuss them here. We shall 


deal with complex-valued square integrable functions or random variables, 
unless stated otherwise. 


B.1 Wiener Integral 


Let (F,.A% ) be a measurable space and Ay an algebra of subsets that 
generates F. 


Definition B.1.1 A function Z from Fo to L?(Q,./4,P) is a (cen- 
tered) orthogonal stochastic measure (OSM) if 


(i) for every A in Fo, EZ(A) = 0; 
(ii) for every pair A and B of disjoint sets in Fo, 


E[Z(A)Z(B)] = 0 and Z(AU B)=Z(A)+Z(B); 


(iii) for every sequence (A,,) in Fo With Ap | ©, 


E|Z(A,)|? — 0, n — ©, 


Put w(A) = E|Z(A)|? for A € Fo. The set function po is a finite 
measure and consequently, by Caratheodory’s theorem, can be extended 
to (F,.F ). We denote the resulting measure by u. 

Let % be the algebra of complex-valued elementary .F o-measurable 
functions f 2, la, where the A; are disjoint sets in .Z o. We define 
the random variable 


n 


[rozan -X ojZA)). 


j=l 


à, 
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B.1.2 Exercise | | 
Show that Y is a dense subset of L’(F, 7, u). [Hint: The family @_ A. 


== S . T 
14 € Æ} is a monotone class containing -F o.] 


Lemma B.1.3 The mapping J: f — [f (A)Z(dA) is a linear isometry 
from 4, endowed with the scalar product induced by L*(F,.F H), into 


L7(Q,.4, P). 
Proof Let f,g be in Ff = > 2, Cilan 8 = >; Bj 1z,. 


E | Í f(Z(dd) | g(r za) = » œB;u(4; N B;) 
ij 
= [recu o 


Definition B.1.4 The continuous linear map J from a dense subset 
L of LF, Fu) to L7(Q,.%4,P), admits a unique extension J from 
L (F, Fu) to L7(Q,.4,P), which is a one-to-one homomorphism of 
Hilbert spaces: we call it the Wiener integral or the stochastic integral, 
and still write [f(A)Z(dA). 

So, for f,g in L*(F,.F, 1), we have 


o [fA)Z(dA) € LAQ, 2, P) 
© EffAZ(@À) - 0 
© E[/@Z@NTe@Z D) = fdu. 


B.1.5 Remark | 
The mapping À ~ J 14(A)Z(dA) from F to L?(Q, Æ ,P) defines an OSM 


that extends Z and is also denoted by Z. The measure u is said to be 
associated with Z. 


B.1.6 Remark 

From the condition Z(F) € L7(Q,.4,P) it follows that u is a finite 
measure with total mass E|Z(F)|? < oo, Let (F,.F ) be a measurable space 
and suppose that there are an increasing sequence (C;)yen of .7=measur- 
able sets with F = U,C;, and, for any k € N°, an OSM Z; with associated 
measure ug on Dg = Cx\Cy_;(D; = C1) endowed with the restriction of 
F. Suppose the Z; are mutually orthogonal. Sticking together the finite 
measures Hx, we obtain a o-finite measure u on (F,.7). For any f in 
LT. F, u), put 
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fran => frazan, 
k 


where fris the restriction of f to D}. The mapping 


1 fr Zd) 


-atali i try from L?(F, F, u) i 2 ' 
is still a mea pone 7H) Into LEQ, Æ , P). In particular 
if A in F satisfies H(A) = >", (AN Dx) < œ, then the map Z defined by 


ZA) = SZ AN Dy) 
k 


satisfies the previous additivity and orthogonality properties. We call Z a 
o-finite OSM. 


B.1.7 Example 


Let B and B® be independent standard one-dimensional Brownian 
motions. For t e R and k > 1, put 


B= B lisa) + BOL eo}, 
Cy = |-k, +k] , 
DE= CAC: (Di = Cy). 


For n 2 1 and —-c <a, < bı < --- < an < bn < œ, put 


Z ( D anba) 5 iua 
i=l 


i=] 


The restrictions Z; of Z to D, define mutually orthogonal OSM, and Z 
can be extended to a o-finite OSM with associated measure (dt) = dt. 
For f in LXR, di), we write [ f(A)aBy, in place of [FAZAN 

his construction of a o-finite OSM can be performed for more general 
Processes than Brownian motions. 


Definition B.1.8 Let T be either R, or R. A set of (complex-valued) 
random variables (Y,),-7 defined on (Q,. 4, P) is a process with orthog- 
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(i) E|¥,|? <0, AET: 
(ii) EY,=0, Ae T; 


(iii) P-a.s., the paths À ~> Y, (œ) are right- 


continuous function 
S On T. 
(iv) for every Ai SA. < às 5A, in T, i 


E[(Ya, — Ya,)(Ya, — Ya,)] = 0. 


We deduce easily the L?-right-continuity from (i), (iii), and (iv). Let 


HO) SEY, = 76)? if A>0 
= —-E|Y; — Yo]? if à<0 and T=R. 


The right-continuous nondecreasing function F on T defines a g- 


finite 
measure dF. As above, we put 


| z( U Ja, 2) = Xe, Fa) 


for a; S bı < --- < an <b, in T. Then Z defines a o-finite OSM with asso- 


ciated G-finite measure dF, and we write | f(A)dY,, in place of [f(A)Z(dA) 
for any f in L*(T, dF). 


B.1.9 Example 


Let Z be a o-finite OSM in (R47,.B(R2)) whose associated measure | is 
a Radon measure. For x in R°, put 


d d 
Eez ( [Lo Ax;,0V ) IT sign (x;). 


[=] =] 


Then (Ÿ,),.R4 is said to be an orthogonal random field (ORF). The mul- 
tidimensional increments of Y, corresponding to disjoint d-dimensional 
intervals are othogonal random variables. 


URE eee Orne 
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B.1.10 Example 7 


The notion of OSM is often more access: 
be an orthonormal basis in L?(R?, dx) and 
sequence in L*(Q,.4,P). For f in L2(R4 


ble than that of ORF. Let (Qn) 
(Gn) be a centered orthonormal 
, 4X), put 


J(f) = > <f, Pa > Gy. 


Then J is a linear isometry, and Z(A) = J(11) define i 
(R? x PR). If the A are real-valued and oe are fee anes 
dard normal variables, then J(f) is real Gaussian for any real-valued func- 
tion f; in fact, the associated ORF has independent increments (Brownian 
sheet). 

To handle complex Gaussian processes we must make clear what we 
mean by a complex Gaussian variable. 


Definition B.1.11 A complex random variable G = G, + iG, is com- 
plex Gaussian if G; and G, are independent (centered) real Gaussian vari- 
ables with same variance 67. A vector X = (X,,...,Xx) is complex Gauss- 
ian if, for every Ay,...,A,% in C, the random variable A,X, +---+A,Xzx is 
complex Gaussian. 

In Example B.1.10, if the random variables G, are independent com- 
plex Gaussian with o? = 1 and (@,) is an orthonormal basis of the com- 
plex space L7(Q,.4, P), then J(f ) is complex Gaussian for every f in the 
complex space L?(R4, dx). 


B.2 Spectral Representation of Stationary (Wide Sense) Fields 


Definition B.2.1 A (centered) random field X is a measurable map 
from R to L2(Q,.4, P). It is called stationary in the wide sense if for 


x,y in R, 


E[X x4yX y] = Cov (X x+y» Xy) 


does not depend on y. We shall write Ky for the function 


K x(x) = Cov (X ray, Xy) 


Which we call the covariance function of X. 
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B.2.2 Example 
Let Ay ae 


Then X is Stationary in the wide sense with covariance function 


We define an OSM on BR?) 


with associated measure u = >; MECS we check the relations 


This construction can easily be extended to the infinite case 


provided 5.7, ||&II3 < ce. 


More generally, the following statement is easily checked. 


Proposition B.2.3 Let Z be an orthogonal stochastic measure defined 
on (R4,.Z?(R%)). Then the random field 


.., A, be n distinct points in R? and &,, 
(centered) random variables in L?(Q, Æ, P). Put 
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Ske 


sees A pairwise Orthogo 
Nal 


Xy = de X € RS. 
j=l 
k . 
Kx(x) = eo 18 113. 
jl 


Z(A) = >) Eð (4) 


jel 


X,= | el AX)7 (di) 


Kx(x) = Í eda). 


x, = D ling 
j=l 


X,- | eA) 7 (di) 
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is stationary in the wide sense and Z2 


-continuous. 
The converse property is a striking fact. 


Theorem B.2.4 Let (X xyepd be a Stationary random field that is 
L?-continuous in the wide sense. There exists a unique OSM Zx on 
( R, BRI) with associated finite measure uy such that 

X, = | ehy (AA) 
Kx(x) = fuxan 
for every x in RS. 


Proof ‘The covariance function is a nonnegative definite function: 


2 0, 
2 


k 2 
) Kx (x; — xj); = | > ox, 
ij=l 


for all finite sequences x;,...,x, in R? and ou 2 Se in C; it is also 

continuous. It follows by Bochner’s theorem that it is the Fourier transform 
° $ d : 

of a finite measure ux in (Rf, (RÐ). The mapping 


n ọ n \ 
` S px; 

OX x; ~> r NT 
j=l j=l 


is a well-defined isometry from a linear subspace of L (Q, 4, P) ee 
linear subspace of L2(R4, Lx). It can be uniquely extended to the tee i 
closed subspaces. On the left- hand side, this subspace iS aie tS L in 
the right-hand side it is the whole space es (RI, ux). If, for f € | 

we put 


f (A)Zx (da) = p '(P), 


then we have 
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D pith), | 
and, for ex(A) i op (ex) = Xx. | 


s an OSM with associated measure uy, and fs + 
It follows ES 
m The measure [Ly 1S ee pu iat a easure of . 

Its Fer ane one, is called the spectra 


i ields 
Transformations of SAR F eee 

see that is stationary 1n the wide = a g 
Given a field X = (Xx) ye R? (Y)yepa by various linear ee (deriva. 
other stationary fields Y n h commute with translations Fi € param. 
tions, convolutions, ne : ons are not, however, defined for a ey 
T ie ae of distributions and generalize unctions 
ae ee a We shall confine ourselves to the L -theory, 
[I.M. Gelfand, N.Y. . 

*-continuous stationary random field 

Definition B.3.1 Let X be an L*-co 


with spectral representation 


ee | eZ, (dh) 


and covariance function. 


Gis Í ely (dd), 


m 
A linear transformation T defined on a vector space of eee 
fields that includes X is said to be X-admissible if there exists 
L?(R?, wy) such that 


(+) Ti | CO) ZAR). xe Rf, 


| ral 
The function C is Called the transfer function, or the gain, or the spect 
characteristic of the Operation. 
We easily get the following result. 


Proposition B.3,2 The random field Y = T(X) defined by (*) is Gute 
sense) Stationary with covariance function 
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Kra) = | ACW 
and the random fields X and Y are stationarily correlated: 
ELY x+yX,y] = f ecaa. 


This last function of x is called the cross covariance function of Y and X 
and is denoted by Kyx. 


B.3.3 Example 

(Band-pass filter.) For d = 1, take C(A) = 1ja,.a,)(A) for -œ < Ay < À < 
400, The previous proposition shows that the spectral measure of Y is equal 
to the restriction of the spectral measure of X to the interval [A,,A.]: so 
we have a band-pass filter that keeps only the waveband [A,, A]. 


B.3.4 Example 
(Linear combinations of parameter translations.) Let 


Y,= > oh xe Ri 


for o,...,04 in C and x;,...,xg in R%. The transfer function of the trans- 
formation taking X to Y is 


k 
C(A) = ps je), 


J=! 


B.3.5 Example 

(Moving averages.) Let X be an L?-continuous (wide sense) stationary 
random field with associated OSM Zy, and let v be a finite measure in 
R°. Put 


C(A) = [evan ke RZ. 


From Fubini’s theorem for Wiener integrals (see Exercise B.3.6), it follows 
that 
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$ 


| ( | AT) dv(y) = Í el ( | AT) Zx(dn) 


The bounded function C is the transfer function of the operation that tak 
X to the stationary random field 6 


Vac fea y). 
This integral is a Bochner integral in the Hilbert space LO, A, P), 


B.3.6 Exercise 
(Fubini’s theorem.) Let Z be an OSM with associated finite measure u 


Let v be a finite measure in (R“,.A(R%)) and ọ € L'(R x RI, y @ v) 
Show that | | 


Í | | OA, vza) v(dx) = Ï( | pA, ava») Z(d)). 


[Hint: First show this relation holds for @(A, x) = 8(A)y(x). Then approx- 
imate in L*(u @ v) by finite sums 


Onyx) = D OAD YE). 
k 


Show that 


| P, (A, X)V(dx) > | o(A,x)vidx) in L(R,u) 
and deduce that in L7(Q,.4, P), 


J (J Pa svan) Z(dh) > Í | | pA, vado) Z(dd). 


Then show that in L7(Q,..4, P), 


J (J Pal, vza») v(dx) > Í | | (A, ozan) vax. 


B.3.7 Example 
(Differentiation.) Let X be an L?-continuous stationary random field such 
that its spectral measure uy satisfies the condition 
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| |A] Ux (dA) < co, 
for k = 1,...,d, the X-admissible transformation with transfer function 
CrQ) = ig 


maps the stationary process X to its derivative process (0X ,/ ORE) pas Le 
in addition, 


| |A\*px (da) < %, 


then X is continously twice differentiable with respect to the L?-norm. For 


d d 

ax dx 

Y,=aX,+ D be + ee 
i j=l 1 OX; K Ferda 


k,l=1 


then Y has the spectral representation 


Y,= ae (a + ilà, b) — (A, cA))Zx (dA). 


B.4 Second-Order Linear Differential Equations 
Suppose first that d = 1. 
Proposition B.4.1 Let X be an L*-continuous stationary process on 


R and let a;, a be complex numbers that are not purely imaginary. Then 
the equation 


(*) Y; = (a; + aY; + a\a2Y t = Xt 


has a unique continuously twice L2-differentiable stationary solution Y. 


Proof Suppose that the process Y is a solution with spectral represen- 
tation 
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OCES Spg 
Y, = ITS te R. 
It follows from the hypotheses that 
Í N'y (dÀ) < ce 

and 

Ya Í iñe™Zy(dÀ) 

Y? = f (-A)e™Zy (dd). 
Thus we have 

X, = | (aay = (ai + an - Zyd: 


from the uniqueness of the spectral representation, it follows that 


Zx (dX) = (aia — (iA)(ay + a2) — A*)Zy (dÀ). 


1 
(id)? — (iA)(ay + a2) + ajaz 


1 1 1 f 
ai- a |ià-a; i-a Pare 
1 


C(À) = 


(ik— a} if a; = a2. 


Then Zy(dd) = C(A)Zx(dA). Conversely, as C € L?(R, ux), this relation 
defines an OSM Zy and it is easy to check that 


Vee | eZ (dd) 
O 


is the solution to the problem. 


To see what can happen if a; or az is purely imaginary, let us consider 
the equation 
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(*) Y,-ioY,=X, teR 
where & € R and Y is continuously L?-differentiable. If Y is stationary, 
then 

(A — a) uy (dA) = px (dar), 
so we must have 


1 
(+**) |a a < ©, 


If this condition is fulfilled, the general stationary solution of (**) is given 
by 


i giMt m 
neg +| aa). 


where & is any (centered) random variable in L*(Q,.%~,P) which is 
orthogonal to the subspace L£. If, more generally, we do not require the 
condition (***) to be satisfied, then any process 


t 
Y, = Yoe” + “| e xX ds 
0 
iqt 


p io — 
= Yge (5 A Wem 


where Yo € L2(Q,.%4,P), is a solution of (**), although not in general a 
Stationary solution. 


B.4.2 Example 


The response of a mechanical system with one degree of freedom to the 
force X, is governed by the equation 


i + 2600Y; + OY = Xis 


where o% > 0 and 0 < & < 1. The transfer function C taking X to Y is 


| 


C(A) (iA)? + ZEWoiA + OH 
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Notice that the formula j 


Yrs | e C(A)Zx (da) 


defines a continuously L?-differentiable Stationary process, not only : 

of finite spectral measure Hx but also when uy is Lebesgue ae ie 
this case the generalized process X is called white noise and Z oul 
finite OSM. The spectral measure of Y is pica 


dh 


dx) = —— — 
ey (w5 — A2)? + 4E2@p 2 


and its covariance function 


re-i T ] 
KO =a C 1 — &?|z| 
0 


Le > 
AE sin Oo V 1 — Ë "| ; 


We handle partial differential equations for stationary random fields in a 
similar way. For instance, the equation 


aY + i(b, VY) — (VY,cVY) =X 


has a continuously twice L?-differentiable stationary solution Y for any 
L?-continuous stationary field X, provided that the matrix c is symmetric 
positive definite and the function 


a + ilb, AY — (A, cr) 


is non-zero over the whole space IR“. Otherwise, we need additional hy- 
potheses on the spectral measure of X for the equation to have a stationary 
solution. 


B.5 Multidimensional Processes and Fields 


In practice, it is often necessary to make use of multidimensional processes 
and fields. The theory is very similar, and we shall only give the notation. 


Definition B.5.1 An n-dimensional (wide sense) stationary field X = 
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is a collectio i irwi 
(Xi )rerd İS on n of n stationary, pairwise stationarily connected 
fields, SAY, A xs... Axe 


This means that the matrix EX tsy Xy] does not depend on y in R4. It 
is called the covariance function Kx of the multidimensional field X 


Definition B.5.2 A so-called vector-valued (centered) orthogonal sto- 
chastic measure (VOSM) Z on a measurable space (F,.F) is a mapping 
from Z to (L*(Q,.4,P))" such that 


(i) for every A in Z; EZ(A) = 0; 
(ii) for every pair of disjoint sets A and B in Z, 


E[Z(A4) Z(B)]=0 and Z(AUB)=Z(A)+Z(B); 
(iii) for every sequence (A,,) in F with A, | Ø, 


EIZA» 30, n>». 


A matrix measure 
u(A) = E[Z(A)‘Z(A)], AE F, 


is Hermitian and nonnegative definite. 


The trace T = > Ujj of u is a nonnegative measure in R? and the 
space L (Rd, t) is denoted by L?(Rf, u). 
A stochastic integral may be defined as in Definition B.1.4. If X is a 


multidimensional L? continuous stationary random field, then there exists 
a unique VOSM Zy on (R4,.Z2(R%)) such that 


ee exc 


the matrix measure py associated Zy is called the spectral measure of X, 
and 


K x(x) = [exc 
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B.6 Real-Valued Fields 


Proposition B.6.1 A (wide sense) stationary random field x 


valued iff the associated OSM Zy satisfies 1S Teal. 


| 
| 


Zx(—A) = Zx(A) 


1 


for every À € PR). In this case, Ux(—A) = Ux(A). 


Proof Assume first that Zx(—A) = Zx(A) for every A in .Æ(R°%), Then 


x (—A) = E[Zx(-4)Zx(-4)] = Mx(A). 


For every elementary f = >)", @;14,, 


Í fOZx(dd) = YZ (Aj) = J T;Zx(-4;) = Í FCM Zx(dd) 
j=1 j=l 


and by density this equality holds for every f in L?(R%, ux). Thus we have 


ee Í ei AZ x (4h) = Í eZ (dÀ) = Xx. 
Conversely, suppose that X is real-valued. Then for every x in R, 
Kx(-x) = Kx(x) 


and it follows from the uniqueness of the spectral measure that, for every 
A in A(R‘), 


[x (A) = Hx(—A). 


If fA) = ZX oye, then 


Í F(A)Zx (dÀ) = ` OX x; = b? OX x, = Í f(-NZx (da), 
j=l jal 


g 


\ À 
| 
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SA aking limits this equality extends to every f in LR Le): Ia 
Zx(A)=Zx(-A), Ae BRİ. O 


Let us return to the real Brownian motion B in Example B.1.7. If f € 
LR dh) is real-valued, the stationary process 


Xi [eroas, 


: not real-valued, its real part is not stationary, and the random variables 
X, are not complex Gaussian in the sense of Definition B.1.11. In fact, it 
is more convenient to deal with a complex-valued Brownian motion. 


Definition B.6.2 A complex-valued process W on R, is a complex 
standard Brownian motion on R, if there exist two independent real stan- 
dard Brownian motions B® and B® such that 


1 


W = 
/2 


(BP +iB®), A20. 


Such a process W is a complex-valued POI. For any f in L?(R,, dA), 
Y, = | foam 


is a complex stationary process. Finally, if we put 


Wr = Walpo} — W-al reo}, 


then it is easily checked that W is a POI on R and, for any real even f in 
LR, dh), 


X,= [era 


'S à real Stationary process with spectral (density fA). 
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B.6.3 Exercis 
e 
Check that 
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7p! Re(z i 
Ee’'R ( = exp {El at 
4 
for any z in C and À in R,. 


B.6.4 Exercise 


C . 
PF local martingales.) If H is any analytic function 
ee r = Re[H(W;)] and Vi = Im[H(W,)] are local m how 
} = (Vh and (U,V), = 0. [Hint: Combine Ito’s f “a 


Cauchy—Riemann equations to find that 4H(W)) = HWaawy D the 


in C,s 
artingales 


B.7 Strictly Stationary Fields 


| Definition B.7.1 | A (centered) random field X = (Xx),epa is Station 

in the strict sense if its finite-dimensional distributions are invariant inte 
e 

all shifts of parameter; that is, (Xxj+y,---,Xx,4+y) has the same distribution 


as (Xx,,...,Xx,) for every n> 1 and Xi Vin Re. 


| Definition B.7.2 A real-valued random field (Xx)repd is a real Gaus- 
sian field if for all x1,...,x, in Re. any real linear combination of 
Xx,,-++,Xx, is a real Gaussian variable. 


Recall that we consider only centered random fields and variables, so 
the finite-dimensional distributions of a real Gaussian field are completely 
determined by the correlation function E[X +X]. Thus, if a real Gaussian 
field is stationary in the wide sense, it is also stationary in the strict sense. 
For instance, the process X in B.6.2 is real Gaussian stationary. 


Definition B.7.3 In the same way, a complex-valued random field 


. | ; d 
(Xx )yepd is a complex Gaussian field if for all xı,...,Xn In R, every 
complex linear combination of X,,,...,X%, iS a complex Gaussian vart- 
able. 


A complex wide sense stationary Gaussian field is also stationary 11 
the strict sense. The process Y in Definition B.6.2 is complex Gaussian 
stationary. 

We obtain more general examples of processes t 
strict sense by introducing processes with indepen 
are more general than Brownian motion. 


hat are stationary in the 
dent increments. They 


| 
| 
| 
| 
| 
| 
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or the moment we Shall adopt the foll 


Owing definition which is 
. - 4 S ; 
ent for our purpose. A wider definit es 


a lon will be given later in Sec- 


tion 


pefinition B.7.4 Let T be R, or R. A collectio 


- (FahreT is a (square integrable centered) proces 
y dent increments (PSII) if | 


n of random variables 
S with Stationary inde- 


per 
() Elva’ <e, Ae T; 
(i) E=0 Ae T; | | 
(iii) P-a.s., the paths À ~> Y;(w) are right-continuous on T; 
(iv) for lo < ee < Àn in T, the increments Ya; 
are mutually independent; 
(v) the distribution of Y4,,, — Yu does not depend on 4, 


— Viasers Yay ii 


for every À in 


It is straightforward to check that a PSII is a POI, and the complex 


standard Brownian motion in Definition B.6.2 is a PSII. But the POI W 
is not a PSII on R. | 


Definition B.7.5 A real-valued process S = (S,)< R, 18 a subordinator 
if 


(i) So = 0 and a.s. S has nondecreasing right-continuous paths; 
(ii) for ào < =- <A, in R;, the increments Sig, NS 
are mutually independent; 
(iii) for A in R,, the distribution of Sh+u — Su does not depend on ų. 


B.7.6 Remark sae 2 
This terminology is due to the following property, which is ia el 
nation in the Bochner sense (see Section D.7 below for re : A : ne 
(Xi)z be a Markov process with state space (E, & ) an pa pA 
group (P;):>0. For every bounded £ -measurable function F; A 


E[f(X,)|X,,u <8] Peda P-a.s. 


| e process X. 

(see Chapter 4). Let S be a subordinator AT ss process, 
en it can be proved that the process (Xs, )x20 1$ | 

With transition semi-group 
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On = | Panas, 
0 


where u} is the distribution of Sj. 


B.7.7 Exercise 
Verify the assertion of Remark B.7.6. 


Since for every p 2 0 

Ble PS] = Eje PJE eP], 
there exists a continuous nondecreasing function @ : [0, °[— [0, -[ with 
(0) = O such that 

Efe PS] = grep) p,À > 0. 
The function © is a Bernstein function (see Section D.3 below): for k > 1, 
and p > 0, we have (—1)*(d*p/dp") < 0. In this section, we consider only 
the case (0) < œ, which is equivalent to assuming ES; < œ. From the 


Lévy-Hincin formula (see Section D.3 below), it follows that there exists 
m > 0 and a o-finite measure u on JO, ~[ satisfying JF xu(dx) < œ such 


that 
(D) = mp + Í (1 - e ?*)u(dx). 
0 


Proposition B.7.8 Let (W:):0 be a complex standard Brownian 
motion and (S,)a>0 a subordinator independent of W, with (0) < =. 
Then Y} = Ws, is a PSII and, for any f € L7(R,, da), 


X, = fertoan, te R 


is a strictly stationary process. 


Proof (a) The paths of Y are right-continuous because those of $ are 
right-continuous and those of W are continuous. We have 


ElYal? = 4E[Bs’) + (BS)?] = ES) = AQ’ (0) < =. 
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EY, = 0. Forn > Lips: j 
e ae iera e D Cand OE Tog Se jura 


E LD Re DRE si no} 
a 
ss iE ($ rewa -et o 
D mu, - BS. JH 


n 


j=l 


-JJe f-o; - ve D wiy 


j=l 


n n 
= I] E| exp < iRe Sy, = mo 
jel F1 


Ti 
M 


It follows that the process Y has independent increments and the distribu- 


tion of Yan — Yy depends only on À. 
(b) For any elementary function g(A) = >: ji aA), we have 


seen that 


8 exp fire [car }] = exp Ya he vo( +) 


j=! 


-apf - f o( DE) a} 


Note that, for h and k in L?(R+, dA), 
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E | | h(À)d YA — Í k(A)d Y E (g0)? Í |h — k|?dh. 
0 


For any g in L*(IR,, dA), take a sequence of elementary functions ( 
converges to g in L*(R,,dA). We find that 8n) that 


E | exp {ire fear, À] = tim Etexp Re fean 
= lim exp f- f o( 2) ar 
n 0 4 
co 2 
= exp -f of IEO ) anh 
0 4 


The last convergence is a straightforward consequence of the inequalities 
0 < p’(p) < p’(0) < œ for p > 0. For v1,...,v, in C and ¢),...,t, in R, 


IE] exp iRe X 7X; 
j=l 
=| exp Re | ( > se) fA)dY 
jl 
2 


= EXD -f v (Ea Jajor dÀ >, 


j=l 


from which it follows that the finite-dimensional distributions of X are 
invariant under parameter shifts. 


B.7.9 Remark 
To get a PSII over T = R instead of T = R,, we take two independent 


copies Y Y and Y a over R, and define 


(d (2 
Yn = Yi Lo — Y Ch- Laco: 
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R 


alternative Way to obtain strict sense station 
es averages With respect to a PSII. For in 
LR dk), it is easy, following the 
1 2 


that 


ary processes is to define 
Stance, given a function F 
proof of Proposition B.7.8, to check 


XxX; = |se- Ad Y> 


is a strict sense stationary process. We will draw the same conclusion for 
is a 
another class of PSII. 


Let Y® and Y be two independent copies of a real-valued PSII on 
R et at 0 (i.e., Yo” = Yé” = 0). Their exponent function w is defined 
+ 
by 


Efe"" ] = exp —Ay(u), A=0, we R: 


by the Lévy—Hincin formula, there exists 6 > 0 and a o-finite measure v 
a R* = R\{0} such that 


| b*v(db) < œ% 
R* 


and 


Loi? = iub | 1 _ iybyv(db). 
y(u) = +50°u a | 


It is well known that every PSII has a version whose paths a.s. have limits 
is 
from the left, so we may put 


(2) | ke KR. 
Tr = Y {1 1120) = Yen lAo) 


This is a real PSII on the whole line. 


1 e e The 


X,- [rc _ hd 


is stationary in the strict sense. 


I y rt in | Xe 
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It 1s easy to check that 


ESSkg | 


j | SES Í AAY | = exp — | Wig(A))dr, | 


| 
As in the proof of Proposition B.7.8, this formula extends to any re | 
tion g in L?(R, dÀ). In fact, if g, — g in L?(R, dÀ), then al fung, | 


Í End Y), > Í gd Ya in L? (Q,. 2, P) | 
and 


| Í (W(gn(A)) — viel 
ae 2" LE ibgn _ ibg _ : x 
SF flen- g ldà+ le ES — ib(gn — g)|dv(b)d), 
R* 
Since there exists c > 0 such that 
fe sn — e8 _ ib(g, — g)| 


< | ef sn-8) — | — ib(8n z g)| 7 (e's = 1)b(g, = g)| 
< cb’ | 2, = g|? + b°|el En T gl, 


it follows that 


exp — | W(gn(A))dA — exp — [rec 


and we are done. For —co < tı < --- < t < œ and u1,...,un in R, we find 
that 


Efexpi@iXs + +++ +UnX;,)] = E api fY, ujf (t; — Ad Yn 
Ji 


= ep ME uf (tj - n)a 


J=! 


This last expression is clearly invariant under time translation. 


IONS OF STATIONARY FIELDS 
RESENTAT 
REP 167 


B.8 Space and Time Discretizations 


A stationary process may be discretized with respect to frequency or time 
(space in the case of a random field). We first deal with discretization with 
respect to time. 


(a) Simulating Stationary Processes by Stationary Sequences 
We restate briefly the main features of the discrete case. 


Definition B.8.1 A complex random sequence X = (X»)nez is sta- 
tionary in the wide sense if 


(i) Xn € P(Q, £, P) neZ 
(ii) EX, = 0 neZ 
(iii) E[Xn+mXm] does not depend on m. 


By Herglotz’s lemma, we may associate with the covariance function 


Kx(n) = E[X minX ml ne Z 


a finite measure uy on (] — m, +7], Ø — x, +x])) such that for every n 
in Z, 


mas | ey (dN), 


The measure [lx is called the spectral measure of X, its density, if any, 
the spectral density of X. 


Theorem B.8.2 Let X = (Xn)nez be a wide sense stationary sequence 


with spectral measure x. There exists a unique OSM Z in (] - 7, +7], 
A] — n, +7])) with associated measure py such that for every n in Z, 


oe fzaun 


We consider the OSM Z as defined on (R, 2R) and carried by 
|~n, +7], and put 


X,= [ezan te R. 


The stationary process (X;)re x extends the stationary sequence (Xn)nez and 
is known as soon as the sequence (X}) is known. Actually, in the general 
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| s 
case of a spectral measure with compact support, the following samp| 
Plin 
g 


f 
statement holds. | 


Theorem B.8.3 Let X = (X:)rer be an L?-continuous wide Sense 
tionary process. Suppose that the spectral measure Hx is absolutely be | 
tinuous with bounded density and vanishes outside the interval ] - } +e 


Then, in L7(Q,.4, P), 


+N : 
sin(tk — nm) 
= li —, te R. 
Xı e ue un = 


ee 
Proof Let Zx be the OSM associated with X. Then 


X= | MZ, (AD), te R. 


In the Hilbert space L?(] — k, +k], dà), the Fourier representation of the 


function 


| 
| 
| 


E; (A) 2 e™ 


with respect to the orthonormal basis 


ee ne Z, 


1 
gn(A) = VE 


is given by 


So 


nez 
> Var 
és ame, 


As the density of x is bounded and vanishes outside ]~k, +k], this expañ 
sion holds still in L?(R, x). It follows from the isometry property that 
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| — sin (tk ) 
X, = lim ee A A ae 
N Too haart tk—nn ge Zx(da) 
+N R 
; sin(tk — nt) 
= lim OR. BA 
Ni lee > tk— nn X rx /k 
n=—N 
in LQ, P). O 


Jf now the stationary process X = (X;) has a spectral measure with 
pounded density but no compact support, we put, for k € N*, 


Xs eg Ly K,+k)(A)Zx (dA) 


and obtain 


k) ~ (k) sin sin (tk — nt) == nT) 
x! = lim >) e 


The quadratic error is given by 
E|X; — XP = px(R\] - k, HI). 


(b) Approximate Solutions of Differential Equations 
Let U be a white noise (see Example B.4.2) with associated ©-finite OSM 


Z. Consider the pth-order (p = 1) linear differential equation 


dX; dPxX, 
(*) aX, + a) —— ketay rc 


It can be shown that if the polynomial P(x) = do + ax +-+: + apx? does 
not vanish on the imaginary axis, the equation (*) has a unique stationary 


solution, which is given by 


B.8.4 Exercise 
Verify the above assertion. 
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s | 


Our aim is to find some approximation of X, that can be simul 
ated 


Our method is to replace derivatives by difference quotients, à 
derivatives by higher difference quotients: 


nd higher | 
dX, 1 | 
AF g (X; — Xh) | 
: | 
. | 
dPX, 1 


ag ge Or pha CCX, 
Hess + (IP Xe). 


The white noise is replaced by a discrete time white noise with Variance 
27/h. Indeed, for A > 0, we put 


W, = 57 [eas mem ODZ CR ne Z. 


The sequence (W,),_7 is centered orthonormal and the differential equa- 
tion is replaced by the sequence of equations 


~ 


1 ~ 1 1 1 ~ 
KP ( 3) ++ (DIX (bn ink po( 3) a pe X (n-ph@p 


= [2m 


=) z We ne Z. 


The sequence (Xs is described by an autoregressive model of order 
q, and 


(**) Xo, = boW n = E ET A S bX noh 


with 
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he recurrence formula (**) to be useful i 
Fort 


out the p first arbitrary values Xo,. 
wipe 


N simulation, it must rapidly 
eee, à (p-1yh- Therefore the linear 


f (n) =-bf(n-1)-----— bof (n — p) 
be stable. Looking for solutions f (n) = u”,u € C, we obtain by 
has to 


Taylor’s formula 
Le Lis 0 
d h uh) ` 


ie i <1 
ow that the roots of P lie in the half-plane Rez <0. ae 
a ae FES is stable. The wide sense Stationary sequence (X,})1ez 
er | | 
eee process with spectral density 
is a 


1 
= ag lÂ), 
TO= sO HP 


where 
Ole) = (L+ biz t+ bpzP). 
0 
Note that 
-iÀ 
1 1 h p( = 
a ame p(¢- qa) > 2x h 

"A 
and so à ] - lt): 

POSE Te 


sr) 


The covariance function is 
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E[X ts] = Í e™ (A) dr 
= Í eho (A) dA, 
where 


ga) = | /1oe@ay 2 Li-x/h+n/H (À). 
=g 
sr as 


As h — 0, gn(A) converges uniformly and in L'-norm to the Spectral den 
sity of X, which we have seen to be | 


1 
8) = ait 


So the process 


l}-r/h,+7/h] (A)Z(dA) 


approximates process X,; precise estimates can be derived from 


Š ; 1 1 
X, = X, ama Jel F A [= ny nm) 
ie 


B.9 Spectrum Discretization 


We now consider stationary processes with discrete spectra, which are in 
some sense approximations of a given process X. 

For instance, given a small h > O and a sequence of (centered) 
orthonormal random variables (N,,), we put 


X, = DAC — sh, (n + DAJ) Zeh" 


neZ 
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Then 


bx = D xd — D, (+ aps, 


nez 


d as h > 0, ug converges narrowly to uy and Ky to K i ; 
en compact subset of R. X x uniformly in 


This kind of discretization preserves the order of magnitude of the decay 
at infinity of the spectral measure, and thus the local regularity (or irreg- 
ularity) of paths is not altered. In the L?-sense, the C% -differentiability is 
the same for X and X because it depends only on the finiteness of the 


integral 
futuran. 


As for the a.s. continuity of paths, it follows from Kolmogorov’s theorem 
that a continuous version for (X;) exists if there exist positive constants 
a, B,€ such that 


E|X, - X,|" < Blz- s|'* 


for all t,s € R. Taking © = 2, we get 


EX — Xl” 7 | je 1| ux(dà) 


and thus an L?-continuously differentiable process has a continuous ver- 
sion. 

B.9.1 Exercise i 
Let h > 0, (Nn)nez be a sequence of orthonormal random variables, and 
(an)nez be a sequence of complex numbers such that there exists & > 2 
with >, lan| |n|% < œ. Show that the series >, N dye" converges a.s. 
in Z((2x/h)T). (Hint: Estimate P(|Nx| > V [kl log |k|) and use the Borel- 
Cantelli lemma.] 


B.10 Simulating Stationary Processes with Stationary Germs 


The simulation of stationary processes is often used to test the quality of 
Indentification, filtering, or forecasting algorithms. It 1s therefore important 
lor the simulated approximate process to have suitable ergodic properties. 
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We assume that X is a real Gaussian process: 


X; = f v2c0s à dB + | V2sindafanane, 
0 0 


where f is an even smooth function, and B® and B® are two indepen. 
dent Brownian motions. Its spectral density is (f (A))*. The following ideas 
would apply equally well, mutatis mutandis, to Gaussian stationary fields 


B.10.1 Zeroth-Order Germs 
Let (Un)nez be a sequence of independent standard normal variables. For 


/>0, put 
| tl 
sm | 2 = nm) 
Vn) = —; , neZ 
2 — AT 
m 
(*) X= D Uni). 


As we have seen in the proof of Theorem B.8.3, the sequence of functions 


1 . 
(A) ee a ezimna/l 
Lu 


is an orthonormal basis in H = L?(] — 1/2,+1/2[, dA), and the Fourier 
coefficients of the function 


e;(A) és eit 
are given by 
(er, f n) = VI Wn (t). 


It follows that, for s,te R, 
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0 < 
EX x) = X vat + SWn(s) 


1 
= I (Crs, es) 


1 2 . 
-3l eM dÀ. 


The process X © is therefore stationary and its spectral density is 
À 1 
r(A) = T l-1214 (À). 


Conversely, if r is the spectral density of a real Gaussian station 
i - a 
X, the representation (*) holds with U, = Xonnyji,n € Z. ry process 
For 0e R anda € R, let us put 


+00 


i | 
XLO = Va V1 Wn (OU, cos Ot + VA sin 0t), 


— C9 


where (Un) and (V,) are two independent sequences of independent stan- 
dard normal variables. We have 


EX LOLO] = 2471 uit + S)Wn(s) cos Ot 


po. 
— 2a’ cos af aldh, 
—(//2) 


which, for |8| > //2, can be written as 


_0+#//2 | 0#1/2 
a J e dh + | chan ; 
-9-1/2 9-1/2 


and for 6 = 0 can be written as 
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R 
1/2 y | 
2 itr 4 | 
2g" e "dÀ. | 


—({/2) 


We conclude that for k € Z, the process X 4/0) 


is i 
| Stationary with 
density Peet 


a(Li-jej192,- el À) + 1 H+1k|1-1/2,4|&|141/2((A)) 


for k 4 0, and 
24° 15-2172 (À) 


for k = 0. 


B.10.2 First-Order Germs 
Again, let (U,,) and (V,) be two independent sequences 
standard normal variables. For ao, ay € R and 0 € R, put . 


XE) -vV S [Una cos Of (t) + a; sin Ory (0) 
+ V, (ao sin OfWy (8) — a; cos OW, (6)], 
where 


1 


i n= ae 
Wis G Er 8n) 


with e(A) = ie, Then, for ere ee 
Ex.) x] 
+S S 
+00 +00 
= 2/[aÿ COS OS Walt + S)Wri(s) + a; COS DRAG + SW) 


+ aoa; sin DX Wit + Wn ICS) — Wait + s)Wi, (5) = 


of independent 


CREER ENS 
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2 tlens e 2 x 
r 249 aoe ( mee s) ig <a pR Ores, es) + Aaya, sin Ore... es) 


1/2 l - ; l 
-| a oe + e + a A2 (e $e) 
n 2agaı Me™ s g aij dxr 
2 ID. 
z | gag +a hy dhs | EA- (ay — aA Pdh. 


—(1/2) —(1/2) 


We conclude similarly that for k € Z, the process X“!) is stationary with 
spectral density 


(ay — aÀ + KD) Vy -e1-1/2,-k1s/1Q) + (ao + ai — k D liki-i/2ki aÀ) 


for k + 0, and 
(ao + ary La) 


for k = 0. 


B.10.3 Approximation 

Let us take two independent sequences (Uy) and (Vix) ne Z, k (= Z, 
of independent standard normal variables. As a first-order approximation 
of the process X, we consider the process 


+00 +oo 
fo Vi S FD D YOU ng cos klt + V n sin KID. 


—— 00 n=— co 


This process is a sum of independent processes that are of the aes 
in Section B.10.1 and whose spectral density we know. Hence 1 | 
tionary and its spectral density is 


ro(A) = D FD yet: 
k=—00 


ximation 
Similarly, using first-order germs, we gèt the appro 
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+oo +20 


XV > > [Une (I) cos kltyn i(t) +f’ (kD sin king 


k=—co N=—0o 1()) 


+ Ving(f (KD) sin kltyin s(t) — f’(kD cos kin, (t))], 


which is a stationary process with spectral density 


FO) = DF ED + A= RDS DY? tiyoriy). 


wes 


Thus we have constructed stationary processes X’? and X/! whose Spectral 
density is close to that of X. To study the quality of such an approximation 


~ 


we suppose that the process X and the processes X°° and X"! are defined 
on the same probability space, and obtain L?-estimates from which follow 
results in law that make sense here in a simulation framework. We proceeq 


as follows. 
The spectral representation of X is 


X,= [ V2 cos Atf(A)dBS + f V2 sin Atf(A)dBo, 
0 0 


so let us define To and T 1 by 


Foà) = >. f (KDl ikiiki lÀ) 


K=— 00 


and 


FO = SF D+ A= EDF ED) error) 


Les 


The functions fo and f | are even up to a negligible set, so if we put 


y? - i: V2 cos Afo) dB? + ik V2 sin Mf (A) dB 
0 


and 


N 


— 
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y! = f V2 cos Af (À) dB; + | V3 sin Af (A) dBŸ 
0 


n the processes Y° and Y! are Stationary Gaussi 
ý ity ( fo) and (f1)? respectively. So Y? 
aren the same law as X/1. 
Y But we obviously have 


an and centered with 
has the same law as X’? and 


EIX, - Y°}? = 2[ If 0) - Fo ax 
0 


and | 
EIX, - |? = 2| If) FO ax 
0 


B.10.4 Remarks | | 

(1) In practice we have to truncate the infinite sums and approximate 
“1,0 
X; by 


- +K 
XVI FED Y WOT kit + Vay sin kit 
k=-K n:|(t1/2)—ntt|<L 


Then 


K 
EP -XPP= Pan SPan vo 


Ik|>K k=-K n:\t(1/2)—nn|>L 
K +00 
2 1 
< ` FD + > PE Das ma 
|k|>K k=-K m=[L/n] 


Which can be made small by choosing K ane 5 uy pa are 
(2) The processes X!® and X»! are ergodic and a a 1982] 
à i | 
Gaussian with a spectral density [Cornfeld, Fomin, sik igs eee Fa and 
(3) Naturally, we can repeat what we ee = wi 
'St-order germs with pth-order germs as well. 


Yy 
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B.10.5 Remarks 


Nonlinear Representations 
Iterative definitions such as 


X n+1 = F (Xn, Xn-1; mee ,Xn—p> Un) 


| sei i iscrete time i | 
where F is a nonlinear function and (U n) is a disc a Gaussian White 
y stationary processes. uch processes pa 


noise, may give rise to strictl i ; 
as easy to simulate as AR-processes, but their theoretical study is mor 
enerally non-Gaussian and their second-order structure 


delicate (they are g | 
does not define them in law); see for example Guegan [1988]. 


Representations by Wiener Chaos | | 
In some models, it is interesting to extend the particular features of Gaus. 


sian stationary processes to non-Gaussian processes. 

This can be done with the help of the decomposition 1n chaos of the 
L?-space of the Brownian motion which was discovered by Wiener: Le 
Q = &(R,, R) equipped with its Borel o-field B, and let m be the Wiener 
measure on (Q, B). The coordinates (B;);>0 are a standard Brownian motion 
process under m. Every random variable X € L?(Q, B, m) can be written 


uniquely as 


x=) oP tise xt ABH aby. 


m0 VOTI< tn 


where Tn E L°(P;. An), 


Pris tal <= Se E 


and À, is the Lebesgue measure on P,. 
Then by the isometry property of the stochastic integral (e.g., see 


Bouleau and Hirsch [1991]) 


EX? = D'I PA TN 


n=0 


By this isometry, every unitary group of operators on the space 


Dro L’ (Pn, An) (which is called the Fock space) defines a wide sense st 


tionary process. 
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For simplicity, we consider the second chaos. Suppose that Q, i 
ary operators on L*(P2, Az) (we can take, for example Goo. 


of unit 
t)). The process 


af (tis 


X: F | O,f (fi, t2)dB;, dB,, 
O<tj <t? 


is a wide Sense stationary process. 
Jf, moreover, we take for Q, the translation along the diagonal of P;, 


we obtain 
X; = | f (t+ t1,t + to)dB;, dB, 
O<t | <t2 


with f € L?(P2,A2), which is a strictly stationary process. (The process is 
defined here only for t = 0, but that could be altered by a slight change 


of setting.) 


C MARKOV PROCESSES 


This section is devoted to the simulation of Markov chains and Markov 
processes with transition probability functions that are given by an analytic 
expression or can, at least, be simulated. 


We shall deal only with time homogeneous R¢-valued Markov pro- 


cesses for which there exists a version with right-continuous paths. Typi- 
Rf that generates 


cally, if (P;);>0 is the semi-group of linear operators on 
the Markov process, the canonical version will be denoted by 


(62; IF, Pr Xt Or, Py), 


where 


Q = {right-continuous mappings from R, into Rf} 


(coordinate mappings) 


X (@) = w(t) 
F, = 0(X5,5 St) t20 
VEREA 


6,(w)(s) = W(t + 5) (shiftoperators) 5, f= 0 
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arent E ES, 
v = probability measure on (R4, -2 (R°)) (initial distributio 


: n) 
Py = probability measure on (Q,.¥ ) such that Py o Xp! = y 


and 


Ev[f (X1)|Xu, u Sse Peg (Xs) Py — a.s., O<s< Í < oo 


C.1 Jump Processes 

d l 
For sake of brevity, we write (E, & ) for (R, .2(R“)). Consider a Markoy 
kernel P of (E, €) to itself and a nonnegative bounded Borel function À 
on (E, €). Let MA be the operator on B& defined by 


Mf = NW, fe bE 


and consider the semi-group 
P, = e'a P-D) 


where the exponential is understood in 4(b4,b&). This semi-group 
defines a Markov process, which is called a Markov jump process. It 
behaves like a Markov chain that has kernel P, but whose sojourn time at 
x € E is exponentially distributed with parameter A(x). 

More precisely, let (Z,)1>0 be a Markov chain on E with initial distri- 
bution v and transition kernel P. Let (U,,),>; be a Borel sequence in [0, 


1] independent of (Z,),>9 and, for n > 1, put 


] 
Tn(Zn-1) = AZ D) log (Un) if MZn-1) >0 


= +% E 20 


Then, for t > 0, the process 
(*) Ap = Lol reryZo)) + 2 nln | TKZk- VSS Tk(Zk-1)} 
n=] 


is a Markov jump process with semi-group (P,),50. 


C.1.1 Exercise 
Prove the last assertion. 


X;=a and X, =b for s < u < t. (See Figure 3.3.) 
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ear that (*) provides a method for sj 
Ge. or instance, let (Va)nzo be a sequen 
ces ommon distribution v and let (N rs 


ae ater À independent of (V):20. Put 


gris cl 


Then X is a Markov jump process with 


P f(x) = f (x + y) dv(y) 
Mx) =A, VrxeËE. 


C.12 Exercise 
Show that in this case X has independent stationary increments 


C.2 Iterative or Recursive Simulation 


Die Frage bei allem, was du tun willst: “ist es so, dass ich es unzahlige Male 
tun will?” ist das grösste Schwergewicht.2 [F. Nietzsche, Leipzig, 1901]. 


The first method (iterative) consists of a time discretization: we get a 
Markov chain (Xnn)n>o With transition kernel Pp in place of (X;):0. Thus 
we obtain a sequence of points on a trajectory but have no way to get 
information about the behavior of the trajectory between these points. This 
method is useful for studying the asymptotic behavior. 
ie second method (recursive) involves recursive interpolation between 

es s and £ by computing the conditional distribution of X, given 


recursive simulation of a 


The following five graphs are obtained by 
looked at with 


One- 1 + ° . : hs 
i -dimensional Brownian motion. The Brownian path is 
] BYE . . 
NCreasing scale, which shows its fractal structure. 


Th 
Æt ` r , 6 
question which is the main center of gravity about anything that you want to do: ee 


Want t PRE 
© do it infinitely often in this way?” 


Figure 3.3 (Continu 
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Figure 3.3 (Continued) 


C.2.1 Exercise 
Suppose that the transition probability function satisfies P,(x,dy) = 


p(t,x, y)dy. Show that, for 0 < s < u < t, there exists a regular condi- 
tional probability distribution of X, given (Xs = a, X; = b), which is given 


by 


p(u — s,a,x)p(t — u, x, b) | 

m ns ee dl = 

p(t — s, a, b) x if p(t—s,a,b)>0 
if p(t—s,a,b) =0. 


a 


Je pense d’ailleurs que la manière dont, dans des exposés plus récents j’ai défini 
la fonction X(t) du mouvement brownien est plus claire que celle de Wiener. 
Je vais la rappeler brièvement. Supposant pour fixer les idées X(0) = 0, on se 
donne une suite de variables gaussiennes réduites Ca indépendantes les unes des 
autres; chaque €,, sera l’accroissement de X (t) dans l’intervalle (n— 1, n) de sorte 


que 
X(n) = 6; +É2+...+6, 


est bien de la forme €/n, la lettre & désignant toujours une variable gaussienne 
réduite. Il reste à définir la variation de X(t) dans chacun des intervalles (n- |, ”): 
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qi n’y a pour cela qu’à appliquer la formule d’interpolation 
T X(t) + X(t4 Aie 
X(t+ se 
m 5 3 ey 


[...] Par application successive E la formule (*), on détermine X(t) pour des 
valeurs de t multiples impaires de >, puis pour ceux de 4» Puls pour ceux de À et 
insi de suite. Designons par X(t) la fonction égale si t est un multiple de 27P 
valeurs ainsi obtenues et variant linéairement dans chacun des intervalles 
pe éparent ces multiples. On démontre aisément que presque sûrement, pour p 
Pe Xo) tend vers une limite X(t), et cela uniformément dans tout intervalle 
1 , 
fini. [...] 


On arrive d’ailleurs à d’autres définitions constructives de la même fonction 


Jéatoire en choisissant autrement les points de division sur l’axe des t. Il faudrait 
oc utiliser une formule d’interpolation plus générale que (*). 
S 


Jl faut bien comprendre le rôle de cette formule. Il serait facile de définir ee 
nt chacune des fonctions X,(r) indépendamment des autres, les X p(h/2PYh = 
es ) s’obtenant successivement par l’addition de variables indépendantes de 
la forie 27/2) 6. Mais il n’y aurait aucune corrélation entre les différentes 
roximations et on ne pourrait plus parler de convergence vers une limite bien 
re Il n’y aurait que convergence en loi. [Paul Lévy, 1970] 


3 | hermore, the way in which, in more recent expositions, I have defined i tree 
ee t Dra motion is clearer than that of Wiener. I will ea it p pa ae = 
- i iables, indepen 
i f reduced Gaussian variables, 
= 0 and we are given a sequence o l S ‘ 
ae each €, is the amount of X(t) increases in the interval (n — 1, n), so t 
each o , n 


X(n) =i +62+---+őn 


i i t still 
i educed Gaussian variable. We mus 
i n, the letter Ẹ always denoting a r Sonn 
a varies in each of the intervals (n — 1,n). For this, we ne y 
the interpolation formula 


POR ORALE 


i values of t equal to odd multiples 

By successive applications of (*), we determine a F for the function that, when / ke 

a ER i shee $ TAN ae Reel and varies linearly in ee n a 
| . nes i t, as p tends to in , Ap 

intervala thats ar pe multiples. It is seu ae ne interval. ‘ie p 

tends almost En to a limit X (r), and that this 1s eee random function by choosing oe 
ie | tructive definitions apes ore general interpolation form 

nu rae would need only to use à M ula. It would be easy to define 

ao se me the role of this formula. 
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C.2.2 Example ! 
Let (X;);50 be a subordinator with explicit transition Probability d 
P(t, x, y) = g(t, y — x). We give two examples: Ensity 


e I-process, 


l -x 
q(t, x) = rO xi le 9 x > 0; 


e the one-sided stable process of order 1/2, 


. 1 2 
g(t, x) = ae tx? exp l-5} x>0. 
T 


We compute the stopping time 
Ta=inf{t>0:X, 2a},  a>0, 
as follows: 


(a) the number 
N =inf{n>0:Xnm 2a}, 


where / is fixed, is computed by iterative simulation: 


(b) between (N — 1)A and Nh we search by binary division the interval 
where X; crosses the threshold a. Thus T, can be computed for each 
trajectory with a priori fixed accuracy. 


C.2.3 Example 


Let (X})0 and (X2),> be two independent subordinators from the above 
class. The subordinator defined by 


each of the functions X(t) independently from the others, the Xp(h/2?)(h = 1,2,...) being 
given successively by adding independent variables of the form 2 ?/ °€. But there would be 
no correlation between the different approximations, and we could not speak of convergence 
to a well-defined limit. There would only be convergence in law. 


| 
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Y, = X y2» t > 0 


has transition probability density 


DUS yy = fac — x)q2(t,z)dz 


put the use of this formula requires rather lengthy and imprecise quadra- 
tures. It is more convenient to proceed as follows. First, by iterative sim- 
ulation, compute 


N=inf{n>0:Xy. 2a] 


2 2 1 . 
and store Xw- XNp) as well as (Xi: ,X x ). By recursive compu- 
tation (N-1)h Nh 


e 2 . 2 
(a) simulate X(y—iyjscn/2) given Xy-pa and Xÿ5 


weer 1 | 
(b) simulate X > given X!, and XL, 
(N—1)h+(h/2) XN-Dh XNh 


and systematically divide the time interval where Y crosses the level a. In 
this way Ta is computed with a priori fixed accuracy. 


C3 Simulation of a Brownian Motion Until Hitting a Hyperplane 


We now turn to the so-called importance sampling method in particle 
| transport problems. This method has close connections with the theory 
of Doob h-processes. | 

We use the following principle: in order to compute E,F(X,;) = PF) 
for a Markov process (X;) we consider a positive Borel function @ and 
define the semi-group 


1 
t an ). 
O(f)= n (of 


If is (P,)-excessive, this semi-group is sub-Markovian and defines a new 
Markov process Y;. We compute 


F 
P (Fœ) = PQIQ: ( E) (x) 


n 
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by simulating Y,. This method may have the advantage of lowerin 

variance of the random variable whose expectation is computed. ue 
As an illustration, we consider simulating an R°-valued Brown; | 

motion (d > 2) until it hits the hyperplane H = {x4 = 0}. Let CR 


| 
À 
| 
| 
| 
| 
| 


t=inf{t2>0:B, € H}. | 


If A is the Poisson kernel in the half-space, the h-transform of the Brow. 
nian motion yields a new Markov process that has the same distribution 
as the Brownian motion conditioned to converge to O (or any other Point 


in H) at time Tt and then to be killed. 

We now proceed to explain how this can be done. We first need some 
classical results about one-dimensional Brownian motion. These are wel}. 
known, but we give short proofs for the sake of completeness. 


Proposition C.3.1 Let (B;) be a one-dimensional Brownian motion 
starting from 0, and let A > 0. We define 


T, =ini {tf 20°>B;=a}. 


(i) (Reflection principle). Put 


Then (B7) is also a Brownian motion starting from 0. 
(ii) The distribution of T, is given by 


Fe y 
Pare <1)= | ex 1-5 bas t>0. 
: 0 V 2s EL 2 


Proof (i) Clearly B’ starts from 0 and has continuous paths. Intuitively, 
both Br, — a and a — Bur, are Brownian motions independent of the 6- 
field .77,. More rigorously, let 0 < s < £ < œ, A e F; and u be a real 
number. It is easily checked that 


Eolexp {iu(B; ia B‘)} 1 {Tass} la] = Eolexp {iu(B; ae Bs) ra<s) 14] 


and 


y 
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polexp (Br = BS) icra) 14] = Eolexp (in(B, — p itil 
s t<Ta} tA]. 


follows from the optional 
It 


2 
Bo exp f sind + = 7 


and We deduce that 


: . 
ampling theorem for martingales that 


2 
Farr, = exp {tier + zC A ro) 


Eolexp {iu(B; it BS) } l {s<Ta<t} 14] 
= Eolexp {iu(2a — B,)}Eo[exp (UB }|Finr M (ser act) LA] 
= Eolexp {—iuBs} 1 is<r,<t} Eolexp QiuB,}| Fr] 
= Eolexp {iu(B; — B;)}liscr cn lal. 


Summing the three equalities we get 


Eolexp {iu(B° — B°)}14] = Po(A) exp 1 (t — ) 


which proves that B; — BY is independent of JZ, and has distribution 
N(O,t—s). 
(ii) 


lestri hral UT EEBre a 
= {Ta < t,B; 2a} U {Ta <t,B; >a} 
= {B, 2a} U {B; >a}. 


These events are disjoint, so 


Po(Ta <tj= Po(B; 2 a) + Po(B; >a) 


_ 2 [ e707 /20) dx. 
V 2t Ja 


-1/2 to obtain 


Finally, we make a change of variables x = (t'?a)s 
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ft 
Po(Ta <t) -f (21s?) aet ag D 
0 


Proposition C.3.2 Let x > 0, let B, 
motion starting from x, and let 


be a one-dimensional] Brownia 
To = int {£20.28 = 0}. 


Then, for any b > 0, 


b 
PAB,<b,To>0- | M(t, x, y) dy, 
0 
where 
Tt, x,y) = pt, x, y) — p(L, x; —y) 
and 
P(t, x,y) = Qu) exp {-(y — x)?/2r}. 


Proof We have 
P,(B; < b, To > t) = P,(0 < B, < b) - P,(0 < B, < b, To t). 


Put 
B; = B, ie AS To 
==, E 12T 


It follows from the reflection principle that B’ is a Brownian motion 
starting from x and 


P,(B; <b, To > t) = P,(0 <B, <b) - P,(-b <B; <0) 


b 
= Í (p(t, x, y) — p(t, x, —y)) dy. z 
0 


We return to a d-dimensional Brownian motion B. Let E, be the upper 
half-space in R“ and let H be the boundary hyperplane, that is, 


D... 


g 
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E, = {z= (x,y)e€ Rf; xe Rtt 
He (=e Rive Rt y0 


put i 
t=inf{t>0: B, ¢ E,}. 


nterested in the behavior of the Brownian motion killed as it leaves 


We are 1 
ose Zo = (Xo, Yo) € E+ and set 


E,. SUPP 


b, = (BI,...,B¢'), 20. 


ce the exit time T depends only on the last coordinate, it follows from 


Sin 
position C.3.1 that 


pro 


t 2 
P,,(t < t) = eee _ Yo 
of ) j (2rsi)!/2 i | 2s as 


and the conditional probability density of by given T= to is 


LAS 
(to, Zo, 9) = (2nto) E exp {eet ; Be RW. 
0 


These expressions provide an approach to simulating T and Bz: 


(a) first pick t with probability density 


I(t, Zo) = (2n0°) yo exp {-yo/2t}, t >Q; 


(b) once we have computed T= to, pick b; with density g(to, Zo, 9). 

All that remains is to simulate the process B between time 0, when it 
lies at zo, and time fo when it reaches H at Q = (80,9). This can be done 
recursively, for which purpose we must first compute some conditional 


distributions. 


Proposition C.3.3 Let DE ESS SES A regular version of 
conditional probability density of B, given T= to, Br = Qo, Br, = Zir An 


B, = z3 (By in R2 ', zı and 22 in E,) is given by 
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T(S — ti, Z1, Z)T(Í2 — S, Z, Z2) 


zE E,, 
T(t2 — f1, Z1, Z2) 


where 


2 2 
H(t, (x,y), (x’, y) = Cr) 4/2 exp i a] 


CS 2 r 2 
(exp [0 l- = +) | 


Proof Let 91, Y, > be bounded Borel functions on E,,k bea bounded 
Borel function on R°°!, and p be a bounded Borel function on R,, which 
vanishes outside ]f,+c[. From the Markov property of Brownian Motion 
it follows that, provided fı > 0, ' 


Ezo [91 (Br, )W(Bs)P2(Br,)p(WK(br)] 
= Ez [9 (B; )1 {tr} )W(Bs)1 tes) P2(B;, )1 {t>12}P(T)k(bz)] 
= Ez [91 (Br) lir EB, DWBs-1, l s-r) 
Beg, ,, [92(Br-s)1 tor-s} EB, -s [P(t + T)k(b:)]]]] 


=| nit, 20421)1(21) da | T(S — t1, Z1, Z)Ų(z)dz 


+ + 


l T(t2 — S,Z,Z2)P2(Z2) dZ2 
E+ 


oc = taza) dio | oe k(@o)g(to — t2, Z2, 00) d6b, 


t2 R 
while 


Ez0[01 (B1 )P2(Br )p()k(bz)] 
al nti, zo 20 da | M(t2 — t1, Z1, Z2)P2(22) d22 
Ex 


E4 


ow = 2) do] F  KCOo)8 Cto — t2, Z2, Q9)dQo. 
t2 R2- 


The conclusion follows from the definition of the conditional distribu- 


tion. In the case t; = 0, we remove the first left integral; nothing else 
is altered. D 


aan wees 
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the case tz = lo requires a separate computation 


proposition C.3.4 Let Osi, esse to. A regular i , 
tional probability density of B, given T= to, a an Sune Les condi- 
in E,) iS 71] 1\%o In Es 


MS — £1521, 20 — s, z)8(to — s, z, 80) 
a ee 
Ito = t1, z1)8(fo — ti,z1,00) °? ZEE 
Proof Let 91,¥, k, and p be as in the proof of Proposition C.3.3 
except that p now vanishes outside ]s,-+c[ instead of ]t2, +œ[. We check 
that, for tı > 9, 


Ez [91 (Br YBs)p(k(bz)] 


= | altzo zei (nd: f MS — t1,21,z)W(z) dz 
+ Ey 


f =e 2)p(to)ata| 


Rg 


8(to — S, Z, 80)k(60) dOo 
and 
zo [9 (Br, )P(T)k(bz)] 
= [ T1, Zo, Z1)@1(z1)dz: f [to — t1, Z1)P(to) dto 


| 2(to — t1, Z1, 89)k(8o) dBp. 
Rq-1 


This gives the conclusion, which is trivially the same in the case 
ti =0. 

We can now proceed: we pick successively the exit time fo, the exit 
place 65, and some positions for B;,,...,B;, if O < tı < ++: < tn < to. In 
some particular instances, we may only be interested in trajectories that 
hit a given subset A of H. For that purpose, we use a rejection method in 
the choice of B, given T = fo. It is, however, of some interest to compute 
the global distribution of B+, which we now do. 


Proposition C.3.5 Let A be a Borel set in R?~'. Then 


P,(B, € A) -| h(8,z)d8, ze Es, 
A 
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where A is the Poisson kernel of the half-space, that is, 


I(d/2) y 
nd/2 [0 — z|d” 


h(6, z) = Ge RT, zeE,.. 


Proof 
P.(B, € A) si| Í g(t, z, O)i(t,z)dt dO 
0 JA 


-| Í TAr exp (18 21/21) dt do. 
Ado 
A change of variables s = |0 — z|?/2t gives 


° y De (d-1)/2 
P, B À) = —— — s e “ds dð 
(B1 € A) [ [ (27)7/2 I0 z zl4 


and integrating over ]0,-[ gives the required result. 


C.3.6 Remark 


We first draw the hitting time T = fo: in this way the Poisson kernel h 
does not appear in the simulation formulas. In other instances, it is more 
convenient to draw the hitting place on the boundary first and then to 
handle the process conditioned to hit the boundary at this place, which 
is exactly the philosophy of Doob’s h-transforms. Let us give some de- 


tails. For 8 in R°!,z in E,, @ in b6 A(E,)), put 


he(z) = h(6, z), 


Tl, (Zz) | Mt, z, zolz) dz’ = E[l ior], t20, 
E4 


| 
0 et ao = 
D(z) ME Tu (Aop)(z), t20. 


We check that for s,¢ in R4, 


feere 
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Tusz) = E; [p(B,,5)1 {t>r+s} 
= E, [O(B;00,)1 {t>r} L{r06,>5)] 
= EM t) Eg, LQ(Bs)1 p55) ]] 
= Ty Ts (2). | 


This proves that (Tr) 120 is a sub Markovian semi-group of linear operators 
on b( PB(E,)). A straightforward calculation yields the same conclusion 


ER 
it ne nen, for P1,...,n in bo A(E,)) and k a 
pounded Borel function on R}, 


E-L (Bn) + On Bin) l ft>} k(b:)] 
= | Mt, Z, 21) @1(z1) dz}: a | T(t mi bn 27-15 Zn) 


E, E, 


s On(Zn) dz, | h(6, Zn)k(8) d® | 
Rd-1 


1 
= [re z)k(6) dO de s TCti, Z,21)Q1 (21) 


dz 
ae + I. Moy tnis 2ni; Zn)QOn(Zn)he(Zn) dzy. 


+ 


* ho(z1) 


We can deduce that, for almost every 8 in R®', 


E-[9: (B7) me Pa(Bim Litn) |b, = 0] 
= Q? [9:08 E OP 1, nl: 1). 


This equality gives a precise meaning to our statement above: the hg-trans- 
form (0°) ;>0 of the semi-group (m&):0 of the Brownian motion killed at 
the boundary H is the semi-group of the Brownian motion conditioned to 
hit the boundary at place 6 and be immediately killed. 


C.3.7 Remark 

To deal with balls instead of half-spaces we use a Kelvin transformation 
that maps a ball one-to-one onto the half-space F,. In particular, the exit 
time from a ball of a Brownian motion starting from its center has the 
explicit Laplace transform 


pT wz (p/e n" 
i Tarana 2p) 


Eo e 
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where 7 be 


v IS the modified Bessel function 


Ny(z) = > me) 


MIT (v +m — 1)’ des 


m=0 


We refer to Durrett [1984], Kni | , 
> Knight [1981], and i : 
[1962] for further information. Ciesielski ang Taylor 
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d 
Among R -valued Markov processes, the processes with stationary inde 


d concen- 


| Space translations 
(convolution semi-groups); the Brownian motion as a PSII; the increasing 


PSIs or subordinators; the R-valued PSII and the stable PSII; the notion 
of subordination in Bochner sense and its interest for the simulation of 
infinitely divisible laws; and the existence of a realization of all subordi- 
nated processes of a general Markov process. 


D.1 Definition and First Properties 


We adopt the following definition, which extends Definition B.7.4 by 
removing assumptions (i) and (ii) (for a definition with property (iii) below 
weakened, see, for example, Bretagnolle [ 1973]). 


Definition D.1.1 A process with Stationary independent increments 
(PSII) is a process (X;,)>9 with values in R such that 


(i) Vs,t,0 < s < t,X, — Xs is independent of the o-field J; = o(Xw 
uss); 
(ii) the law of X, — Xs depends only on ¢— s; 
(iii) the sample paths of X are almost surely continuous on the right 
with left limits. 


ones rés 
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This implies that X is a homogeneous 
ys) be the law of X, — X,. For u e R ee Process. Indeed, let 


Ble | Fi] = Ble X XX) oy 


= eilusXs) Jetons) = feu, (dy) 
_s(dy). 


Thus if a sub-Markov kernel P, is defined on (R, A(R) by 


(D.1.2) Prf (x) = fs G+y}(dy), Vf € bB(RR*), 
the family (P;)r20 is a semi-group and 
Ege *? | F] = (Pesce )(X,), T Vie RY. 


This relation extends, since linear combinations of exponential functions 
form an algebra generating the Borel o-field, into 


(D.1.3) EU X)iFl=Pisf (Xs),  Vf € dB (R%). 


Moreover, if f is bounded and continuous, the relation 


fr Gœu(dx) = ETF X: — Xo) 


and the right continuity of the paths give 


lim fr (x)p,(dx) =f (0). 


We have the following proposition. 


Proposition D.1.4 A PSII X is a Markov process admitting a transi- 
tion semi-group (P;) given by 


ad 
P,f (x)= fs (x + y)u(dy), yf € b. ACR‘), 
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where y, is the law of X, — Xo. The family of probability Measure 
S (u) 


is a convolution semi-group on R“: 


U; * Us = Ur+s, Vs, tE R, | 
lim Hr = ĉo narrowly. 


Conversely, let X be an R?-valued Markov process whose sample pa 
are continuous on the right and with left limits and which admit a transite’ 
n 


semi-group (Q;) that commutes with translations of space: 


(D.1.5) Olt f)=tOf), Vie bB(R%) Vre Ri, 


where t,f denotes the function y — f(x + y). Then the mapping f < 
PRY > O,f (0) defines a Radon measure, and hence a probability 


measure V, and we have 


Of (x) = Í f &+ydy). 


It follows that v; * V; = Vas and 


ELf (X; - XF] = ELf (X; —X,)] = Í EE 


Therefore X is a PSII in the sense of Definition D.1.1. 


D.2 Convolution Semi-groups on RÍ 

The families of probability measures on RZ, which are a semi-group under 
the convolution product and are narrowly continuous (see Proposition 
D.1.4), have been characterized by Lévy and Hincin and are related to sev- 
eral analytical and probabilistic properties. We shall quote only the main 
results and refer to Feller [1966, Chapters IX and XVII] and to Berg and 


Forst [1975] for a more detailed exposition. 


D.2.1 Negative Definite Functions 
If X is a PSII, the characteristic function of X, — Xo is of the form 


(D.2.2) EfeiteXr-X0)] = 7w Vre R.. 


rene 
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obtained in thi 
he functions y S way are th : | 
„hich are continuous and vanish at zero. They =. definite functions, 
n 
(p.23) > IVG) + WG) — y(u; — uj); > 0 


i j=l 


(tis Un) € RVG... 6) € Cr. 


p.24 Lévy-Hincin Formula 
The continuous negative definite functions vanishin 


: at zero can = 
resented in the form j an be rep 


(D.2.5) yu) = il, u) + (Zu, u) 


— EX Yu); 


d . . è Dia . 
where u € Ra Z is a semi-definite positive matrix, and v is a positive 
measure on R° such that v{0} = 0 and 


| 1 À |y|Pdv(y) < +o. 
Rd 


The measure v is not in general o-finite on Rf, but only on R\{0}. It 
is uniquely determined by the knowledge of y. It also possesses a prob- 
abilistic interpretation with respect to the sample paths of the process as 


follows. 
If A is a Borel set in RA\{0}, tv(A) is the expectation of the number of 


jumps on ]0, t] whose amplitude is in A: 


NAE Ñ, Xs- Xs). 


O<s St 


If formula (D.2.5) is changed by replacing the expression between the 


brackets by 


| Iyl | 
Ds i 
[ =g y,l + i(y,u) 1+lyl 


A 
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by | 


alid 


o be changed; the formula is : 
asure v is called the Lévy Meas, 


only the coefficient p has t 
the same measure V. The me 


PSII X. 


D.3 Convolution Semi-groups On R+ 
y measures on R+ which are a semi-groy 
d are narrowly continuous, can be stuq 
f the characteristic functions, 


The families of probabilit 
the convolution product an 
the Laplace transform instead O 


-P Unde, 
“a Using 


D.3.1 Bernstein F unctions 


If X is a real PSII with increasing sample paths (subordinator), then 


(D.3.2) pete] = gi), Ype R4. 


in this way are the Bernstein functions vanish 


able properties: Ing 


The functions @ obtained 
at zero. These functions have remark 


e They are the functions f that are C® from ]0,°°[ to R such tha 


f 20,f (0) = 0 and (—1)"(a"f /dx") < 0, Vn 2 1. 

e They form a cone and are concave. 

e They can be extended analytically to holomorphic functions on the 
half-space Re z > 0. 

e If (n) is a sequence of Bernstein functions that converges pointwise, 
the limit is Bernstein. 

e They admit the integral representation 


(D.3.3) (x) = bx + [a -e 7) d\( y), 
0 


M n € R, and v is a positive measure on Rž such that k (yal) 
y) < +. The number b and the measure v are uniquely determined 


by @. The m : , ! 
with @ by D32. V is the Lévy measure of the subordinator associated 


e The ope ‘ | 
oe : L | - on negative definite functions: if w is continuous "n°8 
vanishing at zero and @ is Bernstein vanishing at Ze! 


then the fun +t; 7 : 
Zero (see ea a eTA negative definite vanishing 4! 
-) and D.7 below) 


re With | 
Of the | 
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, The cone of Bernstein functions Vanishin 


EE . © at . 
under composition: if @; and @) are B 8 at zero forms a monoid 
€mstein vanishi 
is Qi © P2- ng at zero, so 


DA Examples 


low some families of PSII iti 
give be whose transition semi-sro 
explicit forms. These PSII therefore can be simulated, either ete ne 
recursively (see Section C.2 above). With the help of the ann in 


the Bochner sense, they enable us to generate other simulatable families 
(see Section D.7 below). 


p.4.1 Brownian Motion in Ri 
The negative definite function 


d 1 
ue R? > 35 \ul? 
corresponds to the heat semi-group given by 


1 


PAGE PIE jar PAT. 


for f € (LABRI) (see Section C.3 above). 


D.4.2 Cauchy Process in Ri 
The function u — |u| is negative definite and corresponds to the Cauchy 
semi-group given by 


d+1 l 
P;f (x) = r( 21) | t({m(\y|? + 12) |a+))/2 fa +y) dy 


(This is the heat semi-group subordinated by the stable subordinator of 
order 5; see Section D.7.) 


D43 T-Process . 
The convolution semi-group (ur) on R+ given by 
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= en 
xe * dx 


l 
(dx) = 10,1) ro 


defines a subordinator with Bernstein function 
pp) = log (1 +p), pe R 


and Lévy measure 
e 
v(dy) = 10,107) a dy. 


D.4.4 Poisson Process on Ry 
The convolution semi-group (u,) on R+ given by 


CO 


Pr 
U = e mon 


n=0 


which correspond to the Lévy measure v(dy) = ða, generates the Poisso 
n 


process on R4. 


D.4.5 Jump PSIIs 
Let U1, U2,...,Un,... be a sequence of i.i.d. R¢-valued random variables 
with common law li, and let (W,) be a Poisson process on R that vanishes 


at zero, has intensity A € RŽ, and is independent from the sequence (U,). 


Nt 
X,=Xo+ D Un, 
n=1 


with the convention Ta -= 0. This defines a PSII with transition semi 


group P, given by 


Let 


P, — e™P-D 
where / is the identity, P is given by 


Pf(x)= f (x + y) du(y), 


aa 
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ntial is taken in the sense of b 
the expone of bounded linear operator 
sae (R“ ) into itself. p s from 
; che associated negative definite function is given by 


nm a 


eV = Bel XeX0) — exp if (eux) _ Dayco] 
Rd 


we see that the Lévy measure of X is the bounded measure /.U(dx). 
We postpone to Section D.6 below the presentation of stable PSII. 
pefore doing this we shall give a construction of a subordinator starting 


from its Lévy measure as an introduction to a probabilistic study of PSIs. 


D.5 Subordinators 
Let be a Bernstein function with integral representation (see formula 
(D.3.3)) 


o(x) = | (1 -e)dv(y), 
0 


where v is a positive measure on R“ with respect to which y > y A 1 is 
integrable. We wish to show the existence of an increasing PSII X such 
that ee 

Ee Pai Xo) — e (PP) p20. 


(a) Suppose first that the total mass Iivi] is finite. As in Section D.4.5, if 
Ui,...,Uņn,... is an i.i.d. sequence with law v/||v|| and (N+) is a Poisson 
process on R, with intensity ||v||, independent of the sequence (U,), then 
the process 


Nt 
x = Xo+ D Un 


n=\ 


iS convenient, 

(b) Let (Y!) and (Y?) be two independent PSIIs of this type, with 
bounded Lévy measures v! and v2 on R,. It is easily seen that Y, = y}+Y; 
IS a PSII and that the Laplace transform D,(p) = Fe-PXr-X0) satisfies (with 


he obvious notation) 
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®,(p) = ©} (p)? (p) 
= exp {- avi fa -e95 


and so its Lévy measure is Vi + V2. 
Like Y' and Y?, the process Y has only finitely many jumps on 
every 


finite interval. We quote the fact that if 


s 7 malfa — e Pry TP] 


lva] 


and 


are the relations of the constructions of Y' and Y°, and the measures y! 
and v? are carried by disjoint Borel sets A! and A? in R,\{0}, then 
e Y! — Y} is the sum of the jumps of Y on ]0,f] whose amplitude is 
in A! 
e y = 
in A’. 
(c) Now let v be a given positive measure on R° integrating x > xl, 
not necessarily bounded. We define bounded measures v™ as follows: 


Y2 is the sum of the jumps of Y on JO,t] whose amplitude is 


Yn = 1 v (dx) = Lyi /n+1),1/n}-V (dx) 


and for n = 0, 
V(X) = lV (dx). 


We consider a sequence Y“”, n > 0, of processes of the type above, asso- 


ciated with the measures v”, 


Put 
n 
n) _ k) 
Ke > y. 
k=0 


tetanic 


| 
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A: : (n), . 
an argument similar to that in (b), (Xf?) is a PSII with Laplace trans- 


BY 
form 


DM) = Ber” KP 


FAR al (1 — ePx) av] 
]1/(n+1),c°f 


ror fixed t, since [x A 1) dv(x) is finite, we have by the Lebesgue 
dominated convergence theorem that 


D:D) nto Pr(p) = exp | 


10,0 


(l1-e?*) avon] : 
As the sequence X ” is increasing, 


| limX;” = > y™ a.s. 
n=0 


and so the formula 


nr 
k=0 


defines a random variable with Laplace transform ®,(p). . 

To show that (X,) is a PSII, we must show that its aie ents are inde- 
pendent, which follows easily from the fact that the (Y; ) are independent 
PSIIs, and we must show that (X,) has sample paths continuous on the 
right with left limits, which follows from 


(i) (X,) has increasing sample paths since it is the limit of increasing 
processes, 
(ii) by the dominated convergence theorem, 


208 
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a= pA Limr 
. k=0 k=0 $S“ 
B k) _ 
= > rex, 
k=0 
D.5.1 Remark 
» the process (y 


Since the measures v™ are carried by disjoint sets 
resents the sum of the jumps of X,, 5 < t, with amplitude in ]1 /(k l rep, 
(resp., in ]1, [| for k = 0). The average number of these jumps ane 

Il 


is therefore 


1 
IV |] =v I |) (VOL cf) if k=0). 


It follows that the expectation of the total number of jumps of X on [0 1] 


1S 
vared (ls 1) = [İvi 
= k+1 k 


Thus we see that if ||v|| = +, the expectation of the number of jumps on 
[0,1] is infinite. It can, moreover, be shown that there are infinitely many 


jumps a.s. on every nonempty open interval. 


D.5.2 Example 
One sided stable process of order L By the equality (take the derivative 


of both sides) 


fa-e Thi) Eve. p> 0, 
i he 2 


3/2 


(\/1/2)./p is the Bernstein function associated with the measure V = 


loadan. F ai 
Let X be the subordinator that corresponds to v by the form 


Ee PX:-X0) = e" VT/2VP. The law w of X, — Xo is 


“a 
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t 


~ (Tt? /16x) | 
43372 © j V4 ,c0( (x) dx. 


H(dx) = 


g Stable PSIIS 
. des PSII X is stable if Vc > 0,1a > 0,3b e 


dy d 
R- has the same la R? such that the 
process (CA: 7 b ictl eae process (Xar + bi). If we can take 
_ 0,X is said to be strictly stable. By the Lév 


y—Hincin formula it can 


— AO 
pe shown that we must have a = c® for à €]0,2]. The number œ is the 


order of the stable PSII X. 


p.6.1 Stable Laws on R 


in the real case, the negative definite function y of the Lévy—Hincin for- 
mula must have the form 


5 piy = ipu + cļ|u|®% [1 — iB sign (u) tan os 
(D.0. forae ]0,2];a41; c>0; Bp’ e R, |BÌ<1. 
y(u) = ipu + clul |i — if sign Ge log |u| 
(D.6.3) | T 
fora = 1,c > 0; B, w’ e R,|B|<1. 


Among these laws, the strictly stable ones are those with 


=D in the case & € JO, 2],a # 1, 
e B=0 in the case o = 1. 


In particular, the symmetric stable laws on R are obtained with y(u) = 
cluj% a €]0, 2]. | 
By (D.6.2) and (D.6.3) we can express the Lévy measure v by: 


a 1-a Il 
COS 


—- 


1 +6: . 1-f | 
. EE P l 0, +o (X) + > 1} 9,04 (x ] ax 


and, for a= 


Y 


SIMULATION OF RANDOM 
"PR 


210 
OPES gp 
211148 1-8 : 
(D.6.5) v(dx)=c er) | > 1}0,+00[ x) + 72. tearc) dx 


If B = 1, the Lévy measure is carried by RY. These stable proces 
called completely dissymmetric. If, in addition, 0 < @ < 1 and ws oe ate 
the process X is increasing and is a subordinator such that > then 


(D.6.6) Efe P%-X0)] = "HP exp | — TO 


D.6.7 Stable Laws on R? 
The stable laws of order 2 are Gaussian. For & € ]0, 2[, the negative defi- 


nite function of the Lévy—Hincin formula must be of the form 


y(u) = ilu, u) +f Wl — ey) 4 i( y, u)L{jy|< }] dV( y), 
R 


where the Lévy measure v is a product measure v(dr, d®) = n(dr)m(d6), 
where m is a probability measure on the sphere S¢-! and nis a measure 


of the form k(dr/r'*%) on ]0, 1. 
We obtain the expressions 


vu = iuu +e] 1@* 


(D.6.8) | [ 1 — isign ((6, u)) tan =| m(d8) 
forue R4,c>0,a€ ]0,2[,a#1. 


via) = isa) +e] (eu) 
(D.6.9) | 1 +isign((@, DE log |(0, 2] m(d@) 


foru € R! a = Le > 0. 


The constant k of the Lévy measure is given by 
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œ fae 
kage ee eee 
“TQ eae toa for a#1 


2 
ns for Q= |; 


_ 0 and the measure m is Symmetric with Tespect to the origin, we 
ptain the stable symmetric laws on R4 with | 
0 


(D.6.10) yu) = c | (6, u)|“m(d®), © € 10, 21. 


sd-1 


D.6.11 Remarks 


(1) If a # 2, the stable symmetric laws on R constitute a family that 
depends on a general symmetric measure on S% !; in the case a = 2 they 
depend only on a symmetric matrix d x d. 

(2) Formulas (D.6.8) and (D.6.9) show that the stable PSIIs are sums 
of completely dissymmetric one-dimensional stable PSIIs that are carried 
by straight lines drawn from the origin and are independent. These sums 
are understood as suitable integrals. 


For the exact simulation of stable laws, see Chambers et al. [1976], 
Devroye [1986], and Boulanger [1990]. 

It is also possible to simulate a stable law approximately by truncating 
its Levy measure. For the processes, that means to approximate a PSII by 
a jump PSII. There exist also approximate simulations based on expansion 
by series. We quote these because of their theoretical and pratical interest; 
see Lepage et al. [1981] and Marcus-Pisier [1983]. 


Proposition D.6.12 Let X be a symmetric stable real random variable 
of order &,0 < a < 2, with 


Ee/"* _ on clul™ 


Let E\,...,E,,... be independent exponential variables with parameter 1 
and let T,=E, ++--+E,. 


Let V be a symmetric random variable such that 


sin f 


a E(\V|1=¢ | a a! 
0 t 


and let V,,... V,,... be independent copies of V, which are independent 
of the E;. es 
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The series | | 


(*x*) = ~(1/a) 
> 23) V; 
— j j 


converges almost surely to a random variable with the same law as x 


lation. The cons 
pter XVII 6.4]) 


| The formula (**) is especially easy to use for simu 
in (*) is given by t 


he relation (see Feller [1966, Cha ant 


cos Po 
sin ¢ D. | 
j à dt =T(2- a) a GE eae). | 


The law of V can be taken to be uniform 
The law of large numbers shows that 
(**) converges quickly when o is small. 
A similar result applies to one-sided stable laws. 


on [—a, a] or discrete 35-0418, 
I; ~j as j T œ, and so the Series 


Proposition D.6.13 Let X be a random variable 


with one-sided stable 
law of order & € JO, 1[ such that 


Ee = e27., A>; 


Leila: beas in the preceding proposition, and let V;,..., Ving sus | 
be constructed as before from a variable V that is positive and satisfies | 


C 
Zya = ——__. 
SL rd — ©) 


Then the series 


co 


Dy, | 
jal 


converges a.s. to a random variable with the same law as X. 


These representations can be extended to R?-valued variables and even 
to processes, see Marcus-Pisier [1983]. 


| d l 
gS WITH STATIONARY INDEPENDENT INCREMENTS 213 | 
proce 
appli cation of the Subordination in Bochner Sense to 
P pulation 
i Principle of the Subordination in Bochner Sense 
7. Jo be a Markov semi-group on the measurable space (E, #). Let 


Let ( A A convolution semi-group on R,, that is, a family of probability 
(M) < on R, satisfying 
m 


He * Hs = Hrs s,1>0 


i Hi = 8 narrowly. 


have already seen that with such a convolution semi-group we can 
ee a subordinator (Ÿ;):> that is unique in law (assuming Yo = 0), 
i a Bernstein function @ vanishing at zero, by the formula 
an 


Ee P#r-Ÿo) — | e Pu (dy) = e FPP). 
(D.7.2) ’ | 
Let us consider the operator 


73) Q, = f Pyu(dy). 


Q, is a positive operator from b& into itself, and O11 Bult (Formula (D.7.3) 
| fs taken here in the elementary sense that for x in E and f in b& 


Of &) = f P f uld). 
0 


This supposes only that t — Pf (x) be measurable.) 
Moreover, 


j = +Z (dy)Us(dz) 
a ae x a» | P,u.,(dz) [ Prec! 
0.0 f ua) | 
= [ru * Us (dx) 
0 


= [rats = Os: 


0 


Thus Q, is a Markov semi-group on (E, 6 ). 
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Let us give an intuitive explanation of how the gener 


be obtained from that of (P,). If we write formally P ator of (Q, 


r= era then ) Can 
Q: = f e A”, (dy) 


That is, by analogy with formula (D.7.2), 
(D.7.4) Q, = e HPA), 


In other words, the generator of (Q+) is —@(—A). Such a formula 

argument leading to it, can be made rigorous in some frameworks re y 
tional analysis. This holds when (P;) is a symmetric semi-group st m 
continuous on L7(F, &,m), where m is a positive G-finite measu p 
Hille-Phillips [1957] and Fukushima [1980]. Da 


D.7.5 Probabilistic Interpretation 


Suppose that (P;) is the transition semi-group of a Markov process (X,) 
such that, if p is the initial law, we have i 


(D.7.6) VO< stesa E A T e bE 
Ef[foXo)f1 Xi) * - fm X sm)] 


= foara Ps, (X0, dx1)f 1x1) 


_ Í Pry ty; Et» dXm) fmm). 


We consider a realization (Ÿ;);>0 of the subordinator associated with the 
convolution semi-group (H;)s>9 such that Yo = 0 and the process (Y;):0 is 
independent of the process (X;)>0. Then we have the following result. 


Proposition D.7.7 The process Z; = Xy,,t = 0, is a realization of the 
semi-group Q;: VO < tı < ++- < tn, V80» ---»8n € bE 


E[go(Zo)gi(Z1,) +++ &n(Zr,)] = Í p(dz0)g0(Zo) | On (Zo, d21)g1(21) 


| [os (Zn-1, dzn)gn(zn): 


np 
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proof 


}20» as ili 
ae (Xe and realizing (Y;),9. Now, by (D.7.6) ty spaces real- 


Elgo(Z0)81 Zn) j ‘8n(Zr,)] =E forrest fy (x dx;)gi(x ) 
ty ’ 1 


is | LE CE (Xn, dxs)ente) : 
since, for every bounded measurable F, 
EF; Yt, = Yro i) Y;, ~~ Yanl 


= [Fon tee > Yn )Me, (Ay) Mp1, (dy) vr Urti dYa). 


we obtain 


E[go(Zo)g1(Zr,) ** + 8n(Zt,)] = [parso fon (xo, dx1)g1 (x1) 
fo. ims den Bun). U 


D.7.8 Case of PSIIs 

If (P;)>9 is the transition semi-group of a PSII (X,)0 on R, then it is 
easily seen that Z; = Xy, is a process with stationary independent incre- 
ments and sample paths that are continuous on the right with left limits. 


Thus it is a PSII. 
Let y be the negative definite function associated with X, given by 


EeitXr-X0) = eV), 


If we compute the function associated with Z, we get 


peitwZr-Zo) sF oi XY 7X0) 
2 Bet 
= ety) 

ith (Y;)r20 (which shows that 


Where 4 is the Bernstein function associated with | non). 
“stein functions operate on continuous negative 
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D.7.9 Example 
The symmetric stable process of order B = 2 € ]0, 2] on Ra ; b 
i b 


. : : tai 
subordination of the Brownian motion by the one-sided stable suborg ; 
Mat 


of order œ e 0, 1]. à Š | Or 
Indeed, if EeP” = e~? , o(p) = p“, then with y(u) = slul? we 
Set 


1 
po wu) = ll. 
ible Law 


D.7.10 Application to the Simulation of Infinitely Divis 
iVisible laws 


There are several methods for simulating infinitely q 


[Devroye, 1986]: 
(i) the rejection method when there is an explicit density; 
(ii) the rejection method when the density is only known by approxi. 
mations from above and below [Devroye, 1986; Bouleau (2)}. 
(iii) methods related to particular forms of laws (Gaussian laws, 
Poisson law, etc.). 

Methods (i), (ii), and (iii) allow us to simulate a set of infinitely diyis. 
ible laws. We can increase this set significantly using subordination, Let 
us suppose that we can simulate an infinitely divisible law Hs on Ri 
depending ‘on the parameter ft in such a way that (Ur) is a convolution 


semi-group. In other words, if (X,) is the PSII vanishing at zero associ. 
ble X, for 


ated with the semi-group (|), we can simulate the random varia 
each fixed ¢. If (Y,) is a subordinator such that we can simulate Y,, then 
clearly we can simulate Xy, (by beginning with Y, and then simulating 
X, for t= Y}). 

Thus, if we can simulate a marginal law of a PSII with negative defi- 
nite function y and we can simulate the subordinator with Bernstein func- 
tion @, then one can simulate the marginal laws of the PSII with negative 


definite function @ o y. 
Examples of R¢-valued PSIIs are: 


e A centered Brownian motion with instantaneous covariance matrix 
Z. By subordinating by a one-sided stable subordinator of cage 
we get a symmetric stable process on R? such that the measure M 0 


formula (D.6.1 0) satisfies 


(2, 0)" = ¢ ie 10, u)| md). 


Bt 


+H STATIONARY INDEPE 
cesses WITH STAT ÉPENDENT INCREMENTS 
RC 
À jump PSII (cf. Section D.4.5), 


P 


e 

. An 
can b 
, A sum or a series of independent processes of the above types. 


gxamples of subordinators are: 


+ unes that is a jump process. This includes all Bernstein 
functions of the form 


i (p) = [a -e ™)dy(y) with [|v| < +o. 


We quote some forms of Bernstein functions reached in this way: 


Le" Ps 


P1,1(P) = I aena’ ae]0,1[, a>0 


obtained when v is a geometric law on aN’; 


@12(p) = 1 - exp [0e - 0], @>0, a>0 


obtained when v is a Poisson law; 


(L= 07e? 


O- oey’ a > 0, œel]l0,il 


1,30) = 
obtained when v = XZ ,(1—a)@a(a+1)- + [(atn—2)/(n- 11 a" nr; 


P1 4(p) = a,b>0 


> 


wee 


obtained when v(dy) = a/b? eI!) dy; 


tagem 
pis) af b- p L a,b>0 


obtained when v(dy) = aLio,p1() d); 
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ot ae n! fı = ad | (bp)" "i (bp)™~: 
n! i la || 


obtained when v(dy) = 1 (0,0) y).y"dy, ne N. 

e A subordinator whose Lévy measure is an average of infin; 
Dirac masses (since then the law of Y, is also an oo any 
masses whose coefficients can be computed by evaluaties of Dirac 
lution product of the involved Poisson laws). In this w 8 the Convo. 
the Bernstein functions of the form SAFA Obtain 


Pp) = D a;(1 - eP), p20, ab;>0, 
i 


e A one-sided stable subordinator of order a. We get the Berngte; 
ein 


functions 


(p) = ap”, a>0, ae€]0,1]. 


e A I-process (cf. Section D.4.3). We get the Bernstein functions 


Q4(p) = alog (1 + bp), a,b >Q. 
e Finally, we can take a sum of independent subordinators of the pre- 


ceding types and, of course, any process obtained from the preceding 
subordinators by subordination. We obtain in this way the Bernstein 


functions of the form 


Qi, O Pin 0-0 Os 


where the @;, are in the families above. 


D.7.11 Application to the Simulation of PSIIs 
Subordination in the Bochner sense allows us to simulate iteratively (resp., 
recursively) the PSII Z, = Xy, as long as we can simulate (Y,) and we can 
simulate iteratively (resp., recursively) the PSII (X,). 

Let us consider the case of recursive simulation. 

Suppose that Xo = 0 and X; has a density f,(x) on R? (see Section C.3) 
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tonal law of X, giv + 
condition u given X, =a and er 
| Wigh e Re by X: =b is given, fors<u<} 
| gsm) = PSE =O Fr-ulb~ x) 
Ff t-s(b — a) : 
we use the following recursive algorithm: 
Choose a point y in the interval [o,B]. Simulate Y, oi 
The point Yy obtained is between Ya and Yg: y given Ya and Yg. 
Ya <Yy< Ypg 
Then simulate Xy, given Xy, and X Yg using the formula above. This 


gives 8YaYyYp' 


Thus, we can refine a simulated path as in classical recursive simulation 


p.8 The Bernstein Monoid 


We have seen some interesting consequences for simulation of the subor- 
dination in the Bochner sense. These are mainly due to the fact that the 
cone of Bernstein functions is stable by composition and is therefore a 
monoid. | 

To simplify the details, we consider the cone Bo, of Bernstein func- 
tions vanishing at zero and taking the value 1 at 1, which we call here the 
Bernstein monoid. The relation 


f<ge the Boi: g=foh 


is a partial order on Bo. A branch is a maximal totally ordered subset of 
Bo... A branch that starts at the root (the identity function) and reaches @, 
splits at @ into an uncountable infinity of sub-branches, which form a set 


isomorphic to the whole of Bo. 
By, is therefore a self-similar or fractal object, which is particularly 
s in mathematics. 


beautiful because of the natural place it take 
On the Bernstein monoid the topologies of pointwise convergence, com- 
| Pact convergence, and compact convergence of the functions and of their 
| derivatives are equivalent. A branch is isomorphic under the order to a 
i of R,. This subset can be an interval [0,a[ or a set with isolated 
Oints. 


nement emmener 


is SIMULATION OF RANDOM | 
PR 


D.8.1 Example of Branches “Si | 
(a) The stable branch. For œ €]0, 1] put fa(p) = p*, pe k | 
{fa à € JO, 1]} is a branch. t They | 
(b) The homographic branch. For a € R, put h,(p) = p(l +a)/( | 
ha is Bernstein because hg = (1 + (1/a))k o g, with k(p) = - , 
and ga(p) = log (1 + ap), and we have already seen th Ms 
are Bernstein. 
(C) Let Pis: @pyse si DE Bernstein functions whose Lévy measy 
have compact supports in Ry (e.8., Ea@p) = 1 — ey ang : | 


that JJ, ©,(1) = 0. Then {01,91 © Q2,.-.,91 0 pr... 6 6 | 
is a branch [Chateau, 1990]. Drees] 


at k and , 
q 


D.8.2 Field of All the Subordinated Processes of a Markov Process 
Let f € Bo. We write (Y~ ):50 for the subordinator associated With the 
Bernstein function f such that Y = 0. Let us define a Markov kernel p 
from (R,,.A(R,)) into itself by putting f 


Pru(x) = E[u(Y{)] for ue dAXA(R,). 
In particular taking u(x) = eP*, we have 
Pp (eP x) = eX), 
The kernels Py satisfy | 
PEP gS Pr Vf,g€e Bon. 


By using the family (Py )reg,,, we can [Bouleau and Chateau, 1989; 
Chateau, 1990] construct on a single probability space a family of random 
variables Y;, such that | 


e for fixed f € Bo1,t — YF, is a subordinator with Bernstein function 
f; 

e for fixed te IR,,f — Yp, is a process which is Markovian along the 
branches with transition semi-group Pr. 


More generally, if (X;)/29 is a Markov process and (Y ft) is the family 
chosen above, which is independent of (X reo, then the family Xy/. ji 
a realization of all the subordinated processes of X, which is Markovis" | 
along the branches of B61. For example, by studying the homograph | 
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‘+ is possible to derive formulas 
sh, it IS 


of Pitman—Yor On some e 
pranc a ingales [Pitman and Yor, 1982]. The detailed 
i m 


tja oi oid and its probabilistic interpretations is still 
ten 


Xponen- 
Study of the Berm. 


in its Carly stages. 


POINT PROCESSES 


rather applied section we build up some 
this 1 


models involvin 
puted objects and compute some related 
tribu 


quantities. 


g randomly 
Jn 
dis 


1 sine Out the Problem 
E. 


sider a straight girder composed of a hetero 
We con 


geneous material whose 
on S is either disc-shaped or T-shaped. Let E(x, y) be a function that 
section » 


rizes the material: E(x, y) = E; if (x, y) lies ina grain, E(x, y) = E> 
ae in the binder. Our aim is to compute 
if (% 


| E(x, yy — y) dx dy 
Pi pa ee 


| | @-voaxas 
S ; 


j ion if it 
f gravity of the section 
= ,Yc) would be the center of gr: : 
eom o Since the number, size, and location of = ae = 
ae ea A random (see Figure 3.4). We want to compute 
random, 
Var(E). 


E.1.1 Remark 


E.2 Setting Up the Models 


dge of the 
ot cut off by the e 
ins are spherical, are not to the planar 
We oo P oué We restrict our study 
girder, and do 


problem. 


Les fects 
(a) Parametrization of the ee gt er 
This is the first step in ase space is the do nee by the 
. 16 se sti ; a orain 
gk a point penn on, the location of ag 

or the disc-shape : 


AE dm 
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rides 


Figure 3.4 


polar coordinates of its center and radius. The parameter space is 
F={(p,9,a):pe [0,R],8 e [0,2n[,a<R-—p)aM}, 


where M is an upper bound for the grain radius. 
For the T-shaped section, the parameter space is 


F= {(x, y, a) : B(x, y, a) cS,asM}, 


where B(x, y, a) is the disc with center (x, y) and radius a. 
We shall deal only with the parameter system (x, y, a). 


wie 
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istribution of the Point Process 
( ) F be the set of words over F: 


Let 
F* = {9} D 
n=1 


F be the Borel o-field of the metric space F. va 
et, À er p Then F* is endowed 


F*={ACF : AN F" e FE Yn>1} 


Let P be an F*-valued random variable with symmetric distribution v 

that is, for every n 2 2, the restricted measure V, = V|pn is ‘variant 
under coordinate permutations. Then P determines a random distribution 
of points in F (of grains in our section). The probability of having n points 
is [lv || = Val n), and in that case we observe a sample of n points with 
distribution IVall Vn. 


(c) Connection with Point Processes 

We denote by Mp ry) the set of finite sums of Dirac measures on F. 
This set is endowed with the o-field .44(F) generated by the mappings 
m ~> mA), A € F. Let Or : F* > Mpp (F) be defined by 


Or (DD = 0, n=0 
Pr En. En) = D Be; n=l. 
i=1 


Then N = @poP is an Mpf (F)-valued random variable, and this is the usual 
presentation of a finite point process. Intuitively, probabilities on MP;(F) 
and symmetric probabilities on F * are in bijective correspondance. We 
refer to Neveu [1977] for a rigorous proof of the following result. 


Lemma E.2.1 With every probability measure © on (Mp; (F), VAID), 
we can associate a unique symmetric probability measure V on (F*,.F*) 
such that © = vo Op - 

It is now clear that the distributio 
the generating functional 


n v of P is completely determined by 


yn(f ) = Elexp {-N(f)}1 


Where f : F — R, is a Borel function and N(f ) = |p fan: In fact we have 


SS Seas amine th ese 


y 
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yn(f ) = >I. exp f- 5 renl 4 a AE (cE = 
n=1 m=1 


(d) Nonoverlapping Condition 
Let Do = {Ø}, Z; = F, and, for n 2 2, let Dry be the subset of Fr def 
Neq 
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by 
D z {((x1, Y1, 41), oes , ns Yns an)) a (xi = xj) 


+ (yj — y} 2 (ai +aj)’, 1 Si <j <n}. 


The grains do not overlap iff Vn is carried by Z, for all n > Q. 


(e) Contact Condition 
For n > 2, let OD, be the boundary of Y, in the metric space Fr. It 


Vn(0Y,) = 0 for every n 2 2, then a.s. the grains do not touch each other 
If v,(0Z,) > 0, several contact configurations may appear, depending 


on the part of the boundary that has positive measure. For instance, jf 
V:(0Z3) > 0, we split up OW; into ((F3)1,(0%3)2, and (0Z3)3, where 


(0Y3); is the subset of 03 with exactly j equalities among the three 
inequalities in the definition of %3. If V3((0%3);) > 0, there is a positive 


probability that the three grains have exactly j contact points. 


E.3 Examples of Probability Distributions 


(a) Poisson Point Processes 
We state and comment on some classical facts about Poisson point pro- 


cesses, and again refer to Neveu [1977] for more details. Let (X,.2") be 
a measurable space such that {x} € Æ for all x in X, and let pi be a finite 


positive measure on (X,.%’). 


Definition E.3.1 A Poisson point process with intensity measure | 
is a measurable mapping N from some probability space (Q,.4, P) into 


(Mp (X),-46(X)) such that 


e for any A € Æ, the random variable N(A) has Poisson distribution 


with mean L(A), 
e if (A;) is a finite sequence of pairwise disjoint elements of Z, then 


the random variables N(A;) are independent. 


We state an existence and uniqueness result. 


d | 


POINT PROCESSES | 
osition E.3.2 For any finite Positive measure u on (X,.2) th À 
poisson point process N with intensity me , 45°) there 


| asure u. The distributi 
uniquely determined and its generating functional is an 


Wn(f ) = exp |- | d- eS) TO 
X 


prop 
0 


fN is 


sketch of the Proof. We put 
(Q, £, P) = N, PN), m) x (X, Z, w)", 


where m is a Poisson distribution with mean @ = ||u]| and uo = 0~'u. Let 


VAT ,Zn,---) be the coordinate mappings on Q. It is easily checked 
shat the random variable 


N(@) = 0 if Y(w=0 
Y(@) 


: SY 82:0 if Y(œ@)21 
i=l 
has the required properties. If 
H = {me Mp,(X): m(Aj) =.m,...,m(Ax) = nx} 


for k >1,Ay,...,Ag in % and m,...,nx in N, then 
PoN '(A)= P(N(A:) = m,- ., N(Ax) = ng) 


can be computed from the hypotheses. It follows from the Caratheodory 
extension theorem that P o N~! is uniquely determined. Regarding the 


generating functional, it is easily checked that for f = > ajla;,@1,.-- 54k 
in R,,A,,...,A,z pairwise disjoint in Æ, we have 
k , 
Efexp-N(f )] = | | exp -0 - e "HAi)) 
i=! 


= exp |- | (1 oo et auto] ` 
X 


This equality extends to all Borel functions f : X > R+. 
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Note that the construction of Q above gives us a Way of simul... 
process N: first draw the number Y of points with Poisson i ting the 
with mean @ = ||ul| and then draw independently a s ample butin 
with distribution lo = 87. A Points 
The framework of the proof of Proposition E.3.2 allows us 
compute the related symmetric probability measure y of Lemma po to 


Put 


P(w) = @ if Y(w) = 0 
= (Z,(@),...,Z,(@)) if ¥(@)=n>1. 


We have N = gy oP, and therefore 


[vall = m) = e° = 
n. 


Vn(A1X+++ X An) = P(Y = n, Z1€A1,. . ., ZnE An) 


Q” 
=e? T u” (Aix. - -xXAn). 


(b) Quasi-Poisson Fields of Nonoverlapping Spheres 
Recall that F is a subset of R? x [0, M]. Let u be a finite positive measure 


on (F,.¥ ) and v be the probability measure on (F*,.F*) whose restriction 
to F” is 

ê 
V, = a lgu” 


(c is a normalization constant). An F*-valued v-distributed random vari- 
able P is called a quasi-Poisson field of nonoverlapping spheres. 


Proposition E.3.3 Let R be a Poisson point process on F with inten- 
sity measure ji and let p be the symmetric probability measure on (F', F) 


such that p o @;' is the distribution of R. 


Put 


D 


ae 
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V 


T Z) Pla. 


proof For any n E NA in Z 


Onl (Ai X ...X An) 


nl J apea, "GD Bn) dho(é). dus) 


e 8 


e 
= Vali X ++: X An), 


while pØ) = e’, Vo({@}) = c. As |Ivl| = pl] = 1 6 
and the result follows. lll = 1, we get e® = c age 


(c) Simulation of Quasi Poisson Field 


The principle is to simulate a Poisson point process and reject the out- 


comes whenever they do not lie in Z. In practice, we first compute the 
integrals 


15 = I. Lay (S1,-- +> En) doli) -+ - do(En) 


by sampling with distribution Uo. This gives us the value of 


Numerically, we have IMAI! = 0 for n > Nmax, SO /1 TUNS from 1 to Mmax and 
elven n we draw n points. 


E.4 Computation of Physical Quantities 


We return to the equivalent Young modulus 


SIMULATION OF RANDOM pao 
ES 
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f E(x, y)(y — yaY dxdy 
E=“ 
Í (y-yc) dxdy 
5 
and the computations of E(E) and Var (E). 


(a) The integrals /,, are computed by simulation. Since n,,,, 
Sing low. 


not too large, we can speed up the simulation for each n by u 
discrepancy sequences. — | 
value is H,(61,...,6,) when 


(b) Let H be a physical quantity whose 
there are n grains with parameters &1,...,6,. Then 


-1 
9P 0” 
E(H) = ( >) D. I, Hn (Ei, ---+En)duo(Es) -+ dio). 


p20 n20 


We must now compute 
JG... Eo- -Endon daté) 


by simulation. This can also be accelerated with low-discrepancy 
sequences. It is obviously necessary to use a computation procedure 
for H,(G1,---,§n). If this procedure is lengthy (finite elements and so 
on), it becomes necessary to speed things up by using low-discrepancy 


sequences. 


Figure 3.5 on the next page shows like Figure 3.3 an application of 
recursive simulation explained in Section C.2. 
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Figure 3.5 Recursive simulation of a path of a two-dimensional Brownian motion. 


CHAPTER 4 


Deterministic Resolution of So 
Markovian Problems i 


In this chapter we show how methods from potential theo 
to solve numerical problems involving Markov processes -o be Used 
are generally faster than simulation methods; their applicabilin Method, 

HY must b 


examined before beginning a simulation. N 


A ELEMENTS IN MARKOVIAN POTENTIAL THEORY 


pen sets, and 


Let £ be a locally compact space with a countable basis of o 
ely, the cone 


let & be its Borelian o-field. &, and b& denote, respectiv 


of positive -measurable functions from E to R, and the vector space of 
bounded #-measurable functions from E to R. @ = €(E) is the Bind 
equipped 


space of real continuous functions on E vanishing at infinity, 


with the uniform norm ||.|. 


A.I Notation and Setting: Feller Processes 
We restrict ourselves to an introduction to Feller processes; for a more 
detailed account of the theory we refer to Meyer [1967] and, more recently, 
to Williams [1979] and Dellacherie and Meyer [(3)]. Other references are 


given in the comments. 
Definition A.1.1 A sub-Markov kernel P on (E, @ ) isa mapping from 
Ex £ into [0,1] such that 


(i) Vxe E & 3A — P(x, À) is o-additive; 
(in VAEe 4 Eax—>P(x,A) is E -measurable. 
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LÉ then Pf is the element of b& defined by 
E kd 


ff 


Pf(x)= | FOP dy), Ver 


| if js a pounded measure on (E, Æ), then HP is the bounded measure 
d by 
define 


uP(A) = [re auas, VAe &. 


pefi nition A.1.2 A semi-group (P;)i>9 Of sub-Markov kernels on 


E g) is called a Feller semi-group if 
(A.1.3) Vf e GE)P,f € CE) 
(A.14) Vf € CAE) lim ||P, f -f|| = 0 


Formula (A.1.3) says that P, acts on the vector space % (E), and (A.1.4) 


expresses the fact that (P;) is a strongly continuous semi-group of linear 
operators on the Banach space @(E). 


A semi-group of sub-Markov kernels on (E, £) satisfying (A.1.3) will 
also satisfy (A.1.4) provided 


(A.1.5) V e GE), VreE, Py f(x) = f(x). 
This is a consequence of the important notion of resolvent family. 


Lemma A.1.6 Suppose (P;): satisfies (A.1.3) and (A.1.5). For every 
bounded Borel function f, the mapping 


G > Pi = [OPa 
8 .Z(R,) ® &-measurable. 


P roof This follows from the fact that, for f € Yo, Pıf € Co and 
‘~ Pf (x) is continuous on the right. U 


fa semi-group satisfies the conclusion of this lemma, it is said to be 
measurable, 


finition A.1.7 The resolvent family of the measurable semi-group 
SUb-Markov kernels (P,) is the family of positive kernels given by 
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8 


Uf Œœ) = | euP fo dt, X>0. 
0 


A.1.8 Properties 


e VA >0,AU,1 < 1 and [AU f|| <||f|| Vf € bE. 


° Uz = Up + (u — À)ULU u A,u>0 
This is the resolvent equation. 
e The linear operators U}, U u» Pt commute. 


s TE yasta eU 
If there is a Borel function f > 0 such that [> P, f(x) dt < +V e p 
the keme U = Uy = Io P; dt is said to be proper. U is the potential 


kernel of the semi-group. 


Proposition A.1.9 Suppose (P,), satisfies (A.1.3) and (A.1.5). Then 
VA > 0, the space U}(#%5) is dense in G, and (A.1.4) is satisfied. 


| Proof Clearly, U\( fo) c and, by the resolvent equation, the Space 
U,(@o) does not depend on À > 0. Now let u be a bounded measure on 


E and let f € €. By (A.1.5), Vxe E 
AUIS Œ) ater F(x) 


_ and AU} f(x) is bounded By fl. Thus 


frere -0> [rau =o 
and, by the Hahn—Banach theorem, U,,@ is dense in #1. 
But the set {f:||P;f —f|| —;_ 0} is closed in @ (Exercise). Then, 


from the easy fact 


VP € UME) = Pif >of in Co 


it follows that E 
Vf E Co Pi f 10 f in Co. 


A.1.10 | 
We will use the following classical results: Let E’ = E U {5}, where ò is 


an extra point added to Æ, and let £” = SU {BU {6}:B e &}. We define 


á X 
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IEN 
EN | 
j group of Markovian kernels on (E, £’ 


b 


Piepert- P,(x, E), 
P,(8, A) = 14(8), 


xe Z 
Ae &. 


xtension is still denoted by (P,). 
w e ae E”. for the set of positive & ’-measurable functions. 
e 


| 
orem A.1.11 There is a realization of the semi-group (P,), which 
aie ous on the right with left limits, with life time, and strongly 
: conti 
Markovian. 


re explicitly, let 2 = {@: mapping from R, into E’ continuous on 
Gail with left limits such that Vs @(s)=5 = (@(t)=8 vr > s)}, let 
nae the coordinates, 
t 


FI = OX, Lt), and PQ = O(X;,t > 0). | 


Then for any probability measure u on (E, &) there exists a probability 
measure Py, on (Q,.F2) such that Vs < t 


ALTI : Yf Ee G EvlfX)F 91 = Pis f (Xs) Py- as. 


It follows by the monotone class theorem that VO < ty < fy < --- < tn, for 
all bounded Borel functions f;,...,f, on E’, 


Elf) fn) = | nawf Pandai an 


| Piece Oni hi nn): 
E’ 


The shift operators 6, on Q are defined by 


ows) = atts) VOER. 


5: is measurable from (Q,.F2) into itself. 


oo 


erty: for 
he Property (A.1.12) can be strengthened to the Markov property 
“Very bounded .7 ? -measurable F 
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~ , = Ms 
(A.1.13) EF 0 0| F ?]= Ex, [F] P,- as. 


Notation 
Ep, Pu, Ex, Py denote here the expectation (resp., probability) 
initial measure is u (resp., Ô,, the Dirac mass at x). when the 


Strong Markov Property. Let G, = F n = NF? and ket SE 
(“;)-stopping time. Then oy 


(A.1.14) Eulf Xs+t)1{s<+00}| Fs] = Prf(Xs)1 (5400) 


(in the sense P,-a.s. for every u) for every bounded Borel function f 
E’. This can be strengthened to ie 


EulF o O51 s<+00}| Fs] = Exs[F]1j 5400} 


for every F bounded .¥2-measurable function F. 


A.1.16 Remarks 
The classical notation is as follows: .FE is the completion of F? with 


respect to Py, 


Fo = Nu FE, Fp = FIN (Py-negligible sets of FH) 


and 
CASE F. 


It can be shown that the o-fields .7; are continuous on the right (7%; = 7) 
and that F; > .F M The strong Markov property holds under P, for the 
filtration (7° Cy. These subtleties are necessary for the entrance times to 
be measurable: if A e € then the random time 


D,=inf{t20:X,€ A} (+, if{.} = ©) 


is not in general an (77? )-stopping time, but only an (.F;)-stopping time. 

These technicalities have no real consequence for the purpose of sim- 
ulation, since every (7 t) -stopping time is P,-almost surely seer 
an (779 )-stopping time (see Dellacherie and Meyer [(3), Chapter AY: 


p.178]). 


oo NNO 


y 
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pte be Borel and bounded. By (A.1.12) 

Le tingale for s St. sete FE) An FD 

m 


A22 A : 
er then (f(X1))i20 is a s | | 
es PS upermart 
i Payu if and only if P,f <f Vr> ae with respect to (7? ) 


A2.3 


e Uy f (Xs) = Ey f e'F(X,) dll À = f e™f(X,)du. 
0 


A.2.4 
Let T be a (Y;)-stopping time. We put X. = 6 and from now on adopt 
the convention that if f is.a function defined on E, f is extended on E’ by 
putting f (5) = 0. | 

For f € b& and for T (-7;)-stopping time, let 


Prf (x) = Ex revo f(X7)] 
Ph f(x) = Ete f Xr). 


By the strong Markov property, if S, T are (S)-stopping times such that 
U=T+S 0687, then 


Vi>0 Pr f=PhP$f. 


A.2,5 
We can extend properties A.2.3 and A.2.4 to universally measurable func- 
tions (the o-field of universally measurable subsets of E’ is the set of 


by 4°) by replacing the filtration ($= F5.) with (Fi). ee. 
a A universally measurable function f from E to R, is called -excessive 
20) if 


: -Atp ¢ =F 
vr>o fre Pf and lime Pf =f. 
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A universally measurable function f from E to R, is sai 
R SA ù + 10 SE 
median (À > 0) it Said to be}. 
UDe 
hy 


Vu =0 Oyu f SF 
it is said to be semu-group-h-supermedian if 
Vr20 eR SL <f. 


(This notion is more restrictive than the preceding one, as is S€ 
grating with respect to e~*‘ dt.) » sen by inte. 
The existence of a realization with sample paths Continuous on the ~ 
shows, in particular, that the bounded functions that are continuous © right 
right on the sample paths and are semi-group-A-superme di re ON the 
sarily A-excessive. Neces. 
For example, the constant functions extended by zero at & are A-exces 
sive, for all À > 0. It can be shown [P.A. Meyer (1), Chapter IX] that i. 
A-excessive functions are also the À-supermedian functions, which satisfy 


f = lim Unf. 


A.2.6 Hunt’s Theorem on Reduced F unctions 
For a Borel set A, T4 = inf{t > 0,X, € A} is an (‘F;)-stopping time. The 


lower envelope of A-excessive functions (A > 0) that dominate on 4 à 
À-excessive function f (i.e., inf { g: g A-excessive, g > f on A}) is equal to 
Pe f(= Efe? Af (Xr, )]) except possibly for those points z € A such that 
P-(T4 = 0) = 0 (points of A that are irregular for A). If Uo is proper, this 


property is valid for A = 0. 


A.2.7 


The kernels 
TA 


Us g(x) = E | e o(X,) dt, 


0 


for À e Z, are the resolvent family of the process that is killed when 1t 


hits A. 
ndary. 


A.2.8 Example 
n. For 


Let G be a bounded connected open set in IR“ with Lipschitz ee 
Let A = (G)°, and let (B,) be an IR“-valued standard Brownian mo 


sécmrmernens esd 
aa aA 
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| > and € À S 1 
| continuous g € bE, the solution of the inhomogeneous Dirichl 
irichlet 


p robl em 


{ hu — Au=g inG 
lim U(x) = f(y) yeðG 


ven, for À 2 0, by 


is gi 
i T 

u(x) = E,[e TAF (Br,)] + Ey f i e™ o(B,) an 
0 


For a derivation, see Port and Stone, [1978]. 


A.2.9 
An additive functional is a positive process (4,) that is continuous on the 
right, (.77)-adapted, and satisfies 


Ats = À; + As O 6,. 


It is therefore an increasing process. The function 


un (x) = f Maa, 
0 


is called the A-potential of (A;). It is a À-excessive function (exercise). 


A.2.10 Example 
fhe b&,, A, =f, h(X,)ds is an additive functional. 


A3 Feynman-Kac Formula 


A.3.1 Generalized Resolvent Family 
Let he b£, We define a kernel Un by 


(A32) Un f(x) = It, f o` htv dsf(X,)dt 


forf e b£., If we write /, for the operator multiplying by 8 € bé, we 


have 
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AN 
(A.3.3) ie > Uy Iy-nU a)" VA 2 ||Al|.. 
n=0 


Proof We have 


Unf (x) = Ex |: e elo Ahk dsecy ) dt, 
0 


Putting u(X,) = À — h(X,). this is equal to 


co 1 t n 
ED fa (f uX) ds) eF(X,) dt 


sE; | u(Xs,) > 
2, O<S] < <Sn<t S 


aXe Eds ddt 
= ED fu, )U(X 11412) : U(X titty) 


n=0 
rere 
e it OU Pate) ddl. 


Using the Markov property (A.1.13) gives 


= JU, Uf. 


n=0 


n 


PROBLEMS 


E 


Consider a function k € b#, such that k > h. The same argument shows 


that 


(A.3.4) Un = > Url", 
n=0 


which implies that the positive kernels Up, Ug satisfy 
(A.3.5) U, = Ur + UnlenUn = Ur + Unlk-nUKk- 


This generalizes the resolvent equation. 


| 
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| 6 hat he b& E 
Ai suppose "3 + and A; = fh) ds. 
AF we put 


Of) = Ele“ f(x), 


dditive functional propert | 
by the a Property of At; we defin : 

n ; e b 

et group (Q:) with resolvent family (V,)),, sien by a sub-Markov 
$ 


Vi = Ush. 


suppose in addition that h € @. For a > À+ h, (A.3.3) gives 


(A.3-7) Vial D DNS UE 


n=0 


which shows that Va is a continuous operator on the Banach space @o. 


The inequality Una f < Uaf for f € bé,, shows that ||AV4|l < 1. 
Finally, for f € @o, definition (A.3.2) gives 


Aa fle) = Ex | | Fe Vere Ios) 454 (Xd | 
0 


which implies that AV, — f pointwise as À — co, and by the argument 
used in Proposition A.1.9, we obtain AVi f —> fin GoVrxGo € Co Vaio 
does not depend on À, Vao is dense in So, tf:AVaf > f in Co) is 
closed and contains Vy %0). 

We quote the result: 


Every contraction resolvent family (RAJ(IA Rall < 1) ona Banach space 
B satisfying À Rax Teo X Yx e B is the Laplace transform of a 
unique strongly continuous contraction semi-group on B. 


This is a part of the Hille-Yosida theorem [Dellacherie and Meyer, 
1987, Chapter VIII, Theorem 4] and shows that Q, is a Feller semi-group. 
t (A, ZA) be the generator of (Pr), that 1s, 


saree, ams eee 
Re 


rene 
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PMs 


| | P,f — 
DA = fy € 6o: lim ZA f isi in Go} 
t—0 t 


and for f € JA Af= lim, (Pf -f)/t. 
We again consider h e Co, where h > 0. The generator (B, ZB) of Q) 
t 


satisfies 
(A.3.8)  ZB- GA and Bf= Af-hf Whe DA. 


Indeed, it is easy to see that ZA = U 160, which does not depend on 
À > 0, and by formula (A.3.4) Vi, % = Ux Go. But if f € ZA, then we 


have 
1 ee 1 
7 QS) - f&)) = Ex [ræ | +7 EL) -Sf 
which converges pointwise to 
—fQ)AQ) + Af (x). 


Thus the limit in @ exists and is equal to Af- hf. 
To sum up, we have the following result, which is known as the 


Feynman—Kac formula: 

Let h € @ and let A, be the additive functional fi h(X,) ds of the Feller 
process (X;);>9 with semi-group (P,) and generator (A, DA). 

Let (Q;)i0 be the semi-group defined by 


Of(x)=Exle“fX)] fe %. 


Then (Q;) is a Feller semi-group with generator (B, WB) given by 


ØB=ZA and Bf-Af-hf Vfe ZA. 


In its various guises the F-K formula is ubiquitous throughout much of quantum 
physics on the one hand and probability theory on the other. ... 

ical 
I find Feynman’s formula to be beautiful. It connects the quantum none 
propagator, which is a twentieth-century concept, with the classical mec 
of Newton and Lagrange in a unique compelling way. 


Í 


r 
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aa 24 
AY 1 
pal | | i 
fy © m formula aaa PESAR but it, (O0, spans two centuries: the differenti: 
My a my formula solves is nineteenth-century ential 


gain, ye Se re while the Wiener integral is 
a CT 
43? ane d d d 
A. Le a bounded connected open set in R? with Lipschitz b 
G un EE ounda 
oe HAS (G). Let f,g,h be in &(R%),h > 0. The solution of he 
problem 
{ hu—+Au=g in G 
lim u(x)=f(y)  ye dG 
is given by 


ejo h(Bs)ds 4 — fi h(Bs)ds 
ux) = Exle FAT) +E, [ hiBods (B, di]. 


(For an extension of this example see Friedmann [1976] and Port and 
Stone [1978].) 

Under suitable assumptions, we can extend to h Borel or even replace 
h by a measure [Feyel and Pradelle, 1988]. We can also take h to be 
complex, which enables us to interpret the Feynman integrals in quantum 
physics. See, for example, Hu-Meyer [(1)] and the references in it. 


B BALAYAGE ALGORITHMS 


We tackle only the elementary theory of balayage (or sweeping), which 
concerns functions. The general theory deals with balayage of measures or 
of the corresponding additive functionals [Dellacherie and Meyer, 1985, 
Blietner-Hansen, 1986]. 


B.1 Poincaré’s Balayage 


Henri Poincaré discovered that knowing the solution of the Dirichlet 
Problem for spheres and knowing the uniqueness of that solution allow 
u$ to solve the problem for open sets of general shape, provided that 
"y have sufficiently smooth boundary. This idea lies behind the local 
Xlomatic potential theories, [Brelot, 1966; Bauer, 1966; Smyrnelis, 1976]. 

ùr main interest is the algorithmic nature of Poincaré’s method. For a 
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EM 
§ 


al 
es see Bre 
Ot 


h 


function on th 
€ Sphere S(0 R), that i ry 
Fe > À), that is the bound 
enter 0 and radius R in R”,n > 2. Then ihe ae ou Bo 
( > 


harmonic in 
B(O, R) and converges to f at the boundary, is giv > Which ig 
, en by 


1 
O,R 


R? — |x|? 
71% 0) dog), 


s ly-x|" 


H*(f)(x) = 
where © is the area measure on S i 
, and Œ, is the ar 
Qn = n(n” /T((n/2) + 1)); that is, a saa 


TP 2 
RENE ET 


(04 = 
DE ae 


Similarly, the function K°(f) defined outside B(0, R) by 


F 7 eo |x|? — R? | 
K°(f)(x) = OR I Ppa xpe VAN 


is harmonic in the open set (B(0,R))° and converges to f at the boundary. 


We have 
Pleo < Ifl WK? Ale < IFI- 
and 
limK*( f)@) = 0 if n23 
do(y) . E 
zi. LO if n= 2. 


to be continuous superhar- 
d, for every 


Definition B.1.1 A function f is said 
satisfies 


monic in an open set G c R” if it is continuous in G and, 
closed ball contained in G with center x and boundary S, 1t 


fx) > 1/068) ff )d60). 


á > 
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| 24 
pemma B.1.2 Let f be continuous in hie ns 3 
€ a 


(i jf, for every x € G, there exists R@®)>0 st 


| 1 
fœ) z a(S) [ro do y) 


for every sphere S that is the bound of 
j f | ary Of a closed b i 

in G with center x and radius r < R(x), then fis FRE contained 
harmonic in G. nuous super- 


(ii) Let B be a closed ball contained in G with boundary S. The func 
tion if , obtained by replacing f with its Poisson integral HS (f), is 
continuous Superharmonic whenever f is superharmonic 


We refer to Brelot [1959] for the proof. 
The function f is called the balayée of f outside B. (The word comes | 
from the fact that it can be shown that the positive measure -Af is obtained 
from the positive measure —Af by suitably distributing the masses from B 
onto the boundary S of B, without changing masses outside B or the total 
mass.) 


(b) Balayage in an Open Set Satisfying the Outside Sphere Condition 
Let Q be a bounded connected open set in R”. Consider a continuous 
function f on the boundary dQ of Q. We are looking for a function H(f) 
that is harmonic in Q and tends to f at the boundary. The uniqueness 
follows from the fact that if a harmonic function in a connected open set 
reaches its maximum inside the open set, then it is constant on the open 
set. 

The main problem is therefore existence. It is sufficient, for this pur- 
pose, to consider the case when f is the trace on dQ of a positive func- 
tion F which is C? on R” and satisfies AF < 0 in a ball A containing 
Q. Indeed, the continuous function f on the compact set dQ. is a uniform 
limit of polynomials p,. If the functions H( pn) exist, then by the prop- 
erties of convergence of harmonic functions [Brelot, 1959], the functions 
Hp) converge uniformly in Q to a harmonic function, which must be 
H(f). But every polynomial is the difference of two polynomials p such 
that p > 0 and Ap <0 in A. 

Now consider a covering of Q by 
boundaries contained in Q. We enum 
vided that every ball occurs infinitely | 
Order B,, B>, B, B», B3, By, B2, Bs, Ba, _…LetB, 


open balls B1,B2,...,Bk,... with 
erate these balls in any order, pro- 
many times, for example, in the 
B?, B?, B*,...,B",--. be 


n~ 
> Of Positiy ide py | 


Fa is harmonic in £2. Th 
é a us Suppose in addition that it is Possible to put a sph i 
i . ; 
H contained in A and contains the point y e JQ Sa Outside 
‘ : t 


and hence 


F is sandwiched between two continuous functions that coincide at y and 


is therefore continuous at y. 
Thus if the open set Q satisfies the outer sphere condition at every point 


of its boundary, then F.. solves the Dirichlet problem for F. 


B.2 The Harmonic Kernel of a Markov Chain | 
Let (E, €) be a measurable space and let P be a sub-Markov Kernel on 
(E, & ). The potential kernel of P is defined by 


Gz 2r (P° = 1). 


For A e &, let /4 denote the operator multiplying by the indicator function 


14. We define 


oo 


Gi= > PL)" 


(BAZAT 
n=0 


) be a realization of a Markov 


ing, as usual, an extra point 6, let (Xn ee 
sent 5} and transition P. (We adopt the conven 


in with values in EU { 
fur = 0 for ue b& and X» = 6.) We introduce the kernels 


D" 


ce 
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m Hau(x) = E,[u(Xr,)], uc bE 
B27 2 

TAP inf {n 2 0,Xn € A} is the hitting time of A. 
Wie get 

HauQ) = Ex Duta een i Mets] 
n= 
= u(x)14(x) + D Ple ac P (Lau) 

and 50 
829 Hal: + LacGacP1a. 


et us state some properties of the kernels H4: We say that u is excessive 
for P if u is &-measurable from E to R, and u > Pu. 


e yke N,v =u ^k is excessive and finite, and so v(X,) is a positive 
supermartingale and Hav(x) = Exv(Xr,) < v(x) = E,[v(Xo)]. Letting 
kteo, we obtain 


(B.2.4) Hausu Vu excessive. 


o Let @ denote the translation operator of the Markov chain (X,). We 
have Ta = 1+T4 00 on {T4 > 0}; hence by the Markov property, 


Hau(x) = 14.u(x) + Ey(u(X7,4)174>0] 
= 14.u(x) + E,[Ex, [uXr,)l17,>0] 
= 14.u(x) + Lac PHau(x) 


and so 
(825) Ha= lat TacPHa. 


° (B.2.6) If u is excessive, then v = Hau is excessive. o 
(B.2.4), Py < Pu < u, which equals to v on A. By (B.2.5) we h 
v= Py on A°, and thus v < Pv everywhere. 

* (B.2.7) If w ie excessive and w 2 u on A, then ay ee 
measures H4(x,.) are carried by A, and so Hau Sw EVE y 


| Applying P on the left in (B.2.3) gives 
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PH, = Pl, + PlacGacPlTa = GacPls 8 
and hence 
(B.2.8) PH, = GacPla. 
Now the general identity 
G(P; + P2) = G(P1) + G(P1)P2G(P 1 + P2) | 


between potential kernels [Dellacherie and Meyer (2)], applied 


. Wi 
P, =I 4cP and P, = 14P, gives Ith 


(B.2.9) G = HAG = Ilac Gace. 


We have thus proved the following theorem: 


Theorem B.2.10 There exists a sub-Markov kernel H4 such that for 
every excessive function u,H,u is the smallest excessive function larger 
than u on A and 


(i) Ha = I4 + TacGacPla 
(ii) Ha = T4 + Tac PHA 
(ili) PH4 = GacPla 
(iv) G= H4G +lacGac. 


Corollary B.2.11 (Domination Principle) Let v be excessive and let 
f 20 be measurable. If v2 Gf on {f > 0} then v = Gf everywhere. 


Proof Let A= {f > 0}. From v > Gf on A we get Hav > HaGf. But 
by part (iv) of the theorem, H4G f= Gf and hence 


v 2 Hav 2 HaGf= Gf. E 


B.3 Numerical Balayage 


A classical problem in geodesy is the leveling problem. 

E is a finite set that can be assumed to be contained in IR’. For every 
x € E, differences of altitude h,—h, = Ory, Oxy = -Oxy have been measured 
between x and some other points y € E (neighbor of x or not). But these 
measurements are not compatible because of the errors; that is, there exists 


| 
| 


ner nine rte it 


7 


| 


AYAGE ALGORITHMS 
pal’ 
| punction h from E to KR such that i 
p 
h(x) — h(y) = 
= Oxy Vx, y. 
choose a quadratic criterion 
pe , then we must find an h whi i 
l ec which minimizes 


> (A(x) — ACy) — Oty), 


(x,y)e M 


where the Dites relation M c E? denotes the a | 
measurement as been taken. The sum can also be Re which a 
ith weights 


(B.3.1) >, q(x, Yh) — hY) — y). 


(x,y)e M 


We can make arguments for such a i 

i quadratic criterion b ; rr 
ae ae ae ASSURE that the errors are small and Se us 
1912], but this 1s not a proof, as the celebrated aphorism ab ia 
squares method says: about the least- 


Tout le m j nes 
Pons y croit parce que les mathématiciens s’imaginent que c’est un fait 
ion et que les observateurs que c’est un théorème de Ba 


[Henri Poincaré, 1912] 


We take the form (B.3.1) as i j i 
we have ) as a starting point. Putting M; = {y: x,y) € M}, 


(B.3.2) > g(x, yA) — h(y)— dy)=0 VxeE. 
yeMx | 


We now consider the stochastic matrix 


(B, à 
3.3) P(x,y) = 40y) P(x,y)=0 if yd Mx, 


Everyone beli LS te a f i 
ieves it, because mathematicians think that it 1s a fact of observation and 


0 Sery " 
ers n i $ > 
S think that it is a theorem of mathematics. 
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OBL 
then equation (B.3.2) becomes Mg 


(P—Dh= —B, 


where 


B(x) = ( >. a } / > q(x, y). 


yeM, yeM, 


Clearly, to construct a relief map, we must also use boundary condi 
So we suppose that the values of h are given at some points in 
we call the boundary and denote by dE. The problem is then fo 
satisfying 


tions, 
E that 
find } 


(P—Dh=-B inE\0E 
h=f in dE. 
The matrix P(x, y) in this problem is used only for points in E\OE, so for 


convenience we change the definition of P, keeping (B.3.3) for x e E\QE 
and putting 


(B.3.4) 


P(x,x)=1 
Px, y)=0 y#x 


for x € OE. 


B.3.5 The Homogeneous Problem 
We consider first the homogeneous problem 


(P—-Dh=0 inE\ðE 
h=f inðE. 


Let no be the function defined by 
no = II flleole\ae +f lar: 


this clearly satisfies Pno < No. 
We define functions n, recursively by 


Na+ = Knnn 


araneae i ter 


ORITHMS 
yAGE ALG 
pal’ 
erator K,, is 
where the OP n defined by 249 


Koz 
n& AN + Pa), } 


x,) is à sequence of points of E | 
d ( i Fe \dE that Contains every point i 
infinitely ye EGE 


Lemma B.3.6 If § is a function from E into R such that 
PE SE, 
then PRES Kn&. 
proof (a) From & > PẸ follows that, outside cat 
KE > PE > PRE. 
(b) At Xn, 


KR) = PEG) 2 PKyS(Xn). 


The sequence Nn is decreasing by the lemma and is bounded below by 
-|| fllo. Let No be its limit. Since Na > Pnn, we have n 2 Pe. Now 
at x, we have Nn (Xn) = Pn,(xn), and so for any fixed point x in E\dE 
there are infinitely many indices n such that Na+ (x) = PNn(x). Thus Ne is 
also the limit of the subsequence of the Nn satisfying 


Nn x) = PY n(x), 
and Na satisfies 


Noo(X) = PY 20(X) Yx e E\E. 


But n..(x) = f(x) Vx € dE, since N(x) = f@ 


IS a ] ) 


; : unique. Unique- 
With the assumptions above, this solution may an ee ae 
ness can be proved if Vx € E\dE the pronio ee positive. 
With transition P starting at X hits the boundary 
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B.3.7 The Inhomogeneous Problem un | 
We now turn to the problem q 
has P-Dh=-B inE\QE 


Putting B = B* — B7, we may assume that B 2 0. Suppose we p... 
a function &o such that nave foung 


| 
| 
h=0 ind. | 
| 
| 
| 
| 
Eo = PÉo + B 


and 
Eo = 0 on OE. 


The algorithm | 


Chad = To 


with J„ defined by | 
Jng = S1E\ {xn} + (Pg(xn) + Bn) 1 bn} 


for a sequence (x,) of points in E \dE that contains every point in £\9E 
infinitely often, converges to a solution €.. of the problem (B.3.8). 

The argument is similar to the preceding one. A sufficient condition 
for (B.3.8) to have a unique solution is that there exists y > 0 satisfying 
y- Py > 0 in E\oE. 


B.3.9 Comment 

Such balayage algorithms are easily implemented and the property of 
monotony makes them especially efficient. They can be generalized to 
some nonlinear operators P. 


C REDUCED FUNCTION ALGORITHM 


C.1 Reduced Function Algorithm with Respect to a Sub-Markov 
Kernel 


| 
Let (E,“) be a measurable space and let P be a sub-Markov keme 
on (E, @ ). A measurable positive function f is called excessive for 


f>Pf. | 


ON ALGORITHM 


puce? ii = 

£ 

R gnition C.1.1 Let u be a bounded measurable function. The func- 
a defined by 

yo" 


Ru = inf {f excessive, f > u} 


jed the reduced function or réduite of u. 
e 
js 04 

jon C.1.2 Let u be measurable and bounded. If uo = u* and 


proposit ae 
Pun, then the sequence u, is increasing and converges to Ru. 


= Un V 
Unt! 


proof Let w = SUP, Un. We have 


Pw = sup Pun < sup un = w, 
n n 


so w is excessive. 


X w, if v is excessive and v 2 u, we have v 2 u* and v 2 un > Pv 2 

0 | 

P Hence y > Un V Pun = Un, and so v > w. [| 
Un: 


Corollary C.1.3 


(i) We also have Uns; = UV Pun. 
(ii) Ru is measurable. 
k a se set {u < Ru} U {u < 0}. 
Proof | (i) Let vn = UV Pun. Then (vn) increases with (un), and if 
Un = Vn then 
Vai = UV Pun S Vn V Pun = Un 


< v Pvn = Val 
Un+1 = Vn V Pvn SVn+1 n 


iii f art (1). 
Part (ii) comes from Proposition C.1.2, and part (iii) follows from p 


<0}. 
(iv) By (iii), we have Ru = PRu on {u < Ru} U lu 
e j Sduite operator 
We i some simple properties of the (nonlinear) réduite op 
R: 
“he, )< Rut Rv) 


) subadditive (R(u + 


aed < Rv 
* Ris increasing (u < v > Ru SK _ tRu,t € Re). 


and positively homogeneous AE ion C.1.1. 
* If u, T u then Run RARE by Doun 
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® — y 1 | 

If lu v| < h for noe excessive function h, we hay BEM | 
hence Ru < Rv + h: similarly, Rv < Ru + h. Thus Ry Vh | 
Ry| < , | 


particular, this is true when h is constant. 
Ih 


C.2 The Filling Scheme and the Réduite Property 


a us consider the action of the réduite algorithm (Proposit; 

7 unction u of the form u = Gf, where GISI < +00, We Fo CI. Jo 

the nonlinear operator v — v v Py. As usual, we let f+ _ a be 
“SAUS, 


f= —f li ¢<0}. Then 


Q- P)KGf= (I - P)\IGfv (Gf-f)] =- PXGF+f") 
PUR 


Hence the operator S = (7—P)KG acts on functions f such that G|f| 
by sweeping by P the negative part of f. S is called the filling Bas i 
Clearly the sequence (S”f)* is decreasing and the sequence (S'y a 
Sat. 


isfies 


STY SPA) 


and so, tends to zero since G| f| < +co. 
If we put A = lim, S”f = lim, (S”f)*, then O < h <f* and 


Gh = limGS"f = lim K”G f = Ru. 
n n : 


We have proved the following proposition. 


Proposition C.2.1 Let u be a bounded measurable function of the 
form u = Gf, where f satisfies G|f| < +c. Then Ru = Gh for some h 


such that O<h<f*. 
Corollary C.2.2 If u and v are bounded and excessive then u-R(u-)) 


is excessive. 

Proof (a) Suppose first that PI < œ < 1. Then w = GU - PW z 
bounded w, and thus u = G f and v = Gg with f = (I — P)u and g = EY 
which are positive. By Proposition C.2.1, R(u — v) = Gh, with 0 SAS 
(f — g)*, and hence h <f and u — R(u - v)=G(f - h) ue de 

(b) We assume only that P is sub-Markov, and write Ps wd ae 
10, 1[. Then u —% R(u — v) is “P-excessive by part (a). When Q ! ” i 


saa ape RU = 
easy to see that “Rw T Rw for any bounded w. Since u R( 


ce? AUNCTION ALGORITHM 
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r B < a, the limit u — | 
gp exce ive - pe ns as R(u — v) is still BP-excess: . 

true FOF o » IL, SO u — R(u — v) is P-excessive es 
is 

‘qui erty stated in Coroll 
e réduite Proper)». rollary C.2.2 w 

The to be characteristic of the cones (such as aoe ph G. Moko- 
ons With respect to a sub-Markov kernel P) from Sick excessive 

can be constructed [Mokobodzki and Sibony, 1968: M Senn 

, ` Z i, 


C3 Optimal Stopping 


| first the idea of an essential sup of an eventually uncountable family 


al 
Fe dom variables. 


of rani 


pefinition-Proposition C.3.1 Let (Y;)i-; a family of measurable func- 
ons from (Q,F) into Re .# (R). There exists a random variable Z 
unique up to P-a.s. equality such that 


(i) Z 2 Y; as. Vie Í; 
(ii) for any random variable H such that H > Y; a.s. Vi € I, we have 
H2Z as. | 


The random variable Z is denoted ess sup... Yi. There exists a sequence 
in J such that Yi, > Z P-a.s. The sequence (Ÿ;,)10 can be chosen 
increasing if the family (Y ie has the following stability property: Vi,j € 
Lake lst. Y2 Yiv Y; a.s. 


See Neveu [1972] for a proof. 
The idea of réduite possesses a probabilistic interpretation in terms of 


optimal stopping. We give only an introduction to this large area. For a 
deeper study we refer to El Karoui [1981]. | 

Let (X,) be a Markov chain with values in (E, €) and transition prob- 
ability P. Let u be a bounded positive measurable function such that the 
present benefit of some decision at time 7 is represented by u(X,). We are 


interested in evaluating 


sup ETAT) 
TET 


Where Fis the set of bounded stopping times for the filtration -7n = 


m, M < n). 
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Proposition C.3.2 


sup Ey [u(X7)] = Ru(x). 
TET 


Proof (a) Ru is excessive and Ru > u, so the se 


Super-martingale and Ru(X,) 2 u(X,) for every n > g rige Ruy ») 


(b) Let Y be a supermartingale such that Y,, > u(X 


notation of C.1.2, te Using th 
Yn 2 E[Y m EA Yn, 2 Elur XF] = Puy(X 
Yn uk(Xn) Vn hence Y, > Uk+1(Xn) 


Therefore, by induction on k, 
Y, 2 Ru(Xn) Vn=0. 
(c) Since u < Ru, we have 
E [u(Xr)] < Ex [RuX7)] < Ru(x) 
because Ru(X,,) is a positive supermartingale. Hence 


sup E,[u(X7)] < Ru(x). 


res” 
(d) For the converse inequality, we put 
Gi =NTE 7 Len} 


V,=ess sup E[u(X7)|-Fy]. 


Te% 
We check first that the family of random variables 
(EUX) SLT € Zn} 

is stable by taking the sup: (consider T = 14c.Ti + 14.72 with 


A = {E[u(X7,)|-Fa] < Elu(Xr,) 71): 
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ws the existence of a sequence (T” mn 
nis sho (TPiz0,T" € F,, such ren 
EUX TZT v,,. 
ghen 


ELV n| Fn) = lim T EUX pe) Fad Vp. 


it follows from the preceding argument applied to the Supermartingale 
Va that 


Ru(Xn) SVa Vn2z0 


and hence 
Ru(Xo) < Vo = ess sup E[u(Xr)|. 7]. 
J 


Taking the expectation Ey gives 


Ru(x) < Ex lim T E[uXy0)|-Fo] = lim Ex{u(X7o)] 
< sup Elu(Xr)]. 
TE.7 


So, finally, Ru(x) = supre 7 Exlu(Xr)], which proves the proposition. L 


C.4 Computation of Reduced Functions with Respect to 
Semi-groups 

Let (P,) be a Feller semi-group on (E, € ) and let (U)a>0 be the associated 
resolvent family and (X+), (7 0), P,) be the associated Markov process, 
as in Section A.1. 


C.4.1 Notation | 
The cone of bounded supermedian functions 1S 


C, = {ue b&,; WU usu VA > O}. 


The cone of bounded semi-group-supermedian functions 1S 


Cy = {ue bé P,u <u Vt > 0}. 
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The cone of bounded excessive functions is PBL, 
C3 = {ue C2; lim P,u =u}. 
For u € b&,, we define 
R,(u) = inf{v € Ci; v2 yu}, rl, DS 
Let u e b&, and let R®u be the réduite (Definition C.1.1) with espe 


to the kernel AU. 


Proposition C.4.2 


Vue b&,, R,(u)=supRu € bE, 
-À 


Proof Let ID, = {ve b&,:AU, Vv <v}. It follows from 
UU y — AU, = (u -ÀU (I - UU), À, > 0 
that T} decreases as À increases. Hence R®u increases with À. Now 
w = limh_ Ru is measurable and belongs to T, Vu, so w € NND =C. 
Finally, f € C, and f 2 u clearly implies f = w, and so we have 


w = R, (u). . O 


Lemma C.4.3 The cone of bounded excessive functions with respect 
to the resolvent, that is, 


Cy= {ue Cin es u}, 
+12 


is equal to C3. 


Proof (a) If u €e C3, then 


Maut) = | Ne ™Pu(x) dt. 
0 


Since lim,_,o P;u(x) = u(x), the fact that the measure Ae“ dt ae 
narrowly to dp as A — © gives limy_5.. AU, u(x) = u(x) and so 4 € &* 


d 


| 
| 
| 
| 
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b) we refer to Meyer [(1)] for the converse. 


proposition CE TN Maltby Rec 3. Then 


R, (u) = Ro(u) = Ra), 


proof The réduite property (Corollary C.2.2) applied to the sub- 
Markov kernel AU), gives the réduite property for supermedian functions: 


(*) Veger foRGS =z eC. 


indeed, using the notation of the proof of Proposition C.4.2, we have 


f-RÔ(F - g)e Ta. 


Letting à tend to infinity on the left hand side, and using À < u > Ty CT 
d Proposition C.4.2, we get 


f-Rif -g)e Li. 


But Ci = Nala, so (*) is proved. 
Now put vı = V- R,(v — w) and v = Ri(v — w), so that we have 
vy, € CE C, and v; +v2 =v € C3. Then 


vı 2 lim AU: 
À—>00 


V2 2 lim AU A V2 
À—c 


and 


4.3). 
Vip +v = Jim AUi + V2) (by Lemma C.4.3) 


But, by Lemma C.4.3, this implies 


i -= li yy andso v,V2€ C3. 
"sS lim AU: and v= Jin ABa 2 
i ies R.(u) < R(u) S 
The Proposition follows, since C1 > C2? C, implies Kı £ 
R3(u), 
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Lett e RÝ. For u € b& 
respect to the kernel P,. 


+, We write Ryu for the réduite a $ 
u * 


With 
Definition C.4.5 A function u € b& is said to belong to the 


of the extended generator of the semi-group (P;),so if there exist Main 
function v such that i Borg 


Uply| <œ and u=U,(pu- v), p>o0. 


In this case we write u € Z,A, and the function v is denoted by Ay 


C.4.6 Remarks 


e If u,v € b&, and u = U,(pu — v) for some p € ]0, 
Up(pu — v) for every p e ]0, ~f. 

e The function v is unique up to changes on a set of potential zero 
(A € & is of potential zero if U,14 = 0, p > 0). It is given explicitly 
by 


co, then U = 


v= lim n(u — nU,u) 
n—eoo 


or by 


1 
v= lim —(P,u—- u). 
lim | Pauw) 


These limits exist outside a set of potential zero [Mokobodzki, 1970]. 
e It is easy to prove (exercise) that for u,v € b& the property 


Uu = U,( pu = v) 


(for some p > 0 or for every p > 0) is equivalent to the property: 


t 
u(X;) — u(Xo) -Í v(Xs) ds is an (.F ?)-martingale 
0 


under P, for every x. 


| ‘acpired bY 
The following result and the remainder of this section are inspue 


Lamberton and Pagés [1991] to which we refer for a deeper study. 
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tion C.4.7 Letue Z,A, with Au bounded 
nded. 


¥1,t >0, Ryu — Rule < (tv t’)\|Aull... 


oof As noted in the preceding remarks. th 
X,)ds is a martingale that is bounded one 


t 
4 proposition C.3.2; 


€ process u(X,)— U(X 0) — 
very finite interval of time. 


Ru = sup E.[u(X EP 
(t) = s)] and Ry yu = us E.[u(Xs)], 


FJ, (resp., Zr) is the set of bound PAN | 
where 77 fr ounded stopping times with values į 
ihe set tN (resp., t/N). ith values in 
se Ir, let S = inf{s e tN:s > S’). Clearly S € Z and 0 < 
GS Sh = 
We have 


S 
E.[u(Xs5)] = E.[u(Xs-)] + E. | Au(X ;) as 
Ss’ 


and so 


sup E.[u(Xs5)] 2 sup E.[u(Xs)] — t||Aul|... 
SE. SET 


L 


The result follows easily. 


Proposition C.4.8 Let u e Z,A with Au bounded and f € R,. Then 


(i) R,(w) = Ro(u) = R3(u) = limyt.. T Rayon 
(ii) R3(u) — Ryu < t||Aulle. 
Proof The sequence Ry/2u is increasing with n. For v € C2, v 2 u, 
we have v > Romu and hence R2(u) 2 Lim foo R(r/2)U. 

By Proposition C.4.7, Vs,t > 0 


lim Rajaryu = lim R(s/21)U. 
nice nToo 
x denote this limit by w. We have w € bé x and VtP,w Sw, Sow € Cr; 
‘nd it follows that w = Rou, which proves part (i). 
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Part (ii) follows directly from 
Ru — Rayu] < t|Aull 


by letting n tend to infinity. 


The preceding results enable us to compute R3(u) for u in the . D 
[f € ZA; Af E b&} under the uniform norm (this space contains Ure o 
since the domain of the strong generator of (P,) is dense in E E) a Ci ‘a 
included in {f € Z,A; Af € b&}). 

Ifue {fe D.A;Afe b&} and ve {f € Z,A; Af e bE) 
that |v — u| < €, then |R3v — Raul < € (since, as seen in Section A ae Uch 
constant functions are excessive). By proposition C.4.8 we can com the 
R3v by choosing ¢ such that ||R3v — Rav] < € (this supposes we aok a 
bound for ||Ay||..). Finally, we can compute Rv by the réduite an a 
(cf. Proposition C.1.2 and Corollary C.1.3). i 


C.4.9 Application: The Réduite àt Finite Horizon 
Let F:E x R} > R be & &.2(R+,)-measurable and bounded, and put 


QF (x,t) = P,F(x,s +t). 


This defines a Feller semi-group on E x R4. 
We define 


Č, = {F > 0 measurable bounded: QF SF Vs>0} 
Č = {Fe Čz: limQ,F = F} 
s—0 


and let Ry and À; be the réduite operators with respect to these cones. 


Definition C.4.10 Let ue b# and T e R°. We put 


Ru = R3(uljo7))(., 0). 


Proposition C.4.11 Letu € Y.A with Au € b&, and define Ku = 
u V P,u. Then, 


R'u = un T (Krpn) 'u 


n Too 
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and 
n T 
RTu- (Kron) us zp lull 


f Set F(x, t) = u(x)Lio,r1()- 
re aus first remark that if RG denotes the réduite of the function 
es respect to the kernel Q;, then we have 
gw" 


(Rajam FX, 0) = (Krn) u. 


indeed, for t = T/2", we have 


QF (x, s)= Pulrorits +t) 


and SO 
FVO;F = Kiulior-nts) + ulir- Cs). 


Let K.F be the left-hand side. It follows by induction that 
t 


(K,)" F = (K,)? ul yo}(s) + (Ko? uloz) + aig 


and this function is Q,-excessive, which proves the assertion. 
(b) Note that if (Y;) is the process associated with the semi-group (Q), 


it is possible in proposition C.3.2 to restrict the sup to stopping times < E 


RE = sup E [FW s)] 
SEA, ST: 


since 


F(Ys) < FË sar). 
? h u- 
(c) It follows that if u € ZA with Au € b%, we can apply the arg 
ment of Proposition C.4.7 to get 


Ray - Rwy lle SUV t’)||Aul|.o- 


RER TS: z 
eo Se ; 


Y 
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Then, by Proposition C.4.8, 
ŘF = R3F = lim T RerpnF. 
nioo 


The result follows by taking the value at 7 = 0, and using the definition f 
RTu and part (a). ` 


D THE CARRÉ DU CHAMP OPERATOR 
There can be no doubt that the fountainhead, the inspiration, of all literature is the 
d the same is true about mathematics 


cal and social universe we live in, an 
d to change, in part because the problems change 


physi 
e. As we learn more and 


Applied mathematics is boun 

and in part because the tools for their solution chang 
more about the world, and learn how to control some of it, we need to ask new 
questions, and as pure mathematics grows, sloughs off the excess, and becomes 
both deeper and simpler thereby, it offers applied mathematics new techniques 


to use. [Paul R. Halmos, 1981] 


The idea of a carré du champ operator is derived from that of a generator 
ator can be defined in several 


of a Markov process. The notion of gener 

ways [Bouleau ( 1),(4); Dellacherie and Meyer, 1987], and the same holds 

for the carré du champ operator. | 
If À is the generator of a Markov 


The formal relationship is as follows: 
process, the carré du champ operator T is defined by 


I(f,2) = A(fg) - fAg — gAf. 


The definition of I involves the domain of A and the question of whether 


or not this domain is an algebra. 

We shall make use of the extended generator, which has already been 
introduced in Definition C.4.5. What was proved in the framework of 
Feller processes can be extended to right processes [Getoor, 1975; Del- 


lacherie and Meyer, 1987]. For the origin of the idea of carré du champ 
operator see Hirsch [1976/77], for the idea of carré du champ operator 
associated with a symmetric semi-group and its connection with the idea 
presented here, we refer to Bouleau and Hirsch [1991, Chapter I, Sec- 


tion 4]. 
Throughout this section, we consider a Feller process (X,)r20 With tran- 
the 


sition semi-group (P;)/20 and resolvent family (U»)p20- The notation is 


same as in Section A. 
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Domain 
i Littlewood Pale VIRE form by p 
de works On 1 —Paley Inequalities, Recall sy: Meyer [1976] in 
iis nit 


ion C.4.5. 
pefinition D.1.1 The function u 


7 is said 
i extended generator, Z,A with Aa 1d to belong i 
t 


syf the domain of 


(i) u € bé 
(ii) v is Borel and U,|v| is boundeg 


U,(pu — (for some P>Oor 
f 

(ii) V= Vp v) (for some P > 0 or for al] p>O). Aa 

Under assumptions (i) and (ii), condition (iii) is equivalent to 


(iv) Vx E Cf =u(X,) 


= UlXay= tt | 
gale Vx € E. (Xo) — fi vs) ds is an (F? 


, P,) martin- 


It follows that the function v is uni 


zero (a set F € & such that U,1r = 0 for some p > 0 or for all p > 0). 
The main importance of the idea of extended generator comes from 
the characterization of the cases when Pe 


of A is an algebra: we shall use 
the following proposition [Meyer, 1976; M 


quely defined up to a set of potential 


okobodzki, 1970; Feyel, 1979; 
Bouleau (1)]. 


| Proposition D.1.2 The following properties are equivalent: 


(i) Z,A is an algebra. 


| (ii) For every law u, for every (7, Pu) square integrable martin- 
| gale M, the measure d(M, M}, on R, is absolutely continuous with 
| respect to dt. 

(iii) C? functions operate on YA: Vd > 1 if F :R > R is C2, and 
| Uy,...,Ug € DA then F(uy,...,ug) € DA. 

(iv) There exists a non-affine function that is the difference of convex 
| functions and operates on SeA. 


We assume from now on that these conditions are satisfied. = ea 
Process is then said to be of Lebesgue type (because of property (11). In 
this case, T(f, g) is defined for f,g € ZeA by 


| 013) (F, 8) = Alfa) — fas — 8AF 


| alculs racket of some 
| the carré du champ operator enables us to calculate the b 


Po~ 
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to Chapter 1 and to Dellacherie and Me 


martingales. We refer nee 
for the stochastic calculus 


yer [og 


Proposition D.1.4 Let fe ZA and g = Af. The ma E 
r BE 


Í 
f Bs — X,) d 
Ct = f(Xr) —fXo) | « s 


has the bracket 


Í 
(cc), = | PA) ds. 
0 


Proof It is easy to see that C f is locally square integrable 
ELCH < +œVt Vx € E. If we apply Ito’s formula to the Semi-mar. 


tingale Y, = f(X,), we get 


PE) =f2(Xo) + Í dst [C C h 
0 


(D.1.5) +2] Y,_dCf. 
J0;,r] 


by Proposition D.1.2. By the fact (Motoo’s theorem) that an absolutely 
continuous additive functional is of the form. |. A(X) ds for some univer- 
sally measurable function h, we have 


«cc =| aoas 
0 


Then, by formula (D.1.5), the process 


PX) - FAX) - 2 | FX )e(X,) ds - Í h(X,) ds 
0 0 


is a local martingale under P;Vx. Hence, by property (iii”) of Definition 


| 
D.1.1, we get Af? = h4 9f It is | 
proved. i afg and thus h = Af? — 2fAf. The resu s 
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| cA e + e 
| w application to Hedging Strategies in Incomplete Markovian 
p? ts 
ke 
Mar 


f representing the evolution of stock values by stochastic pro- 
idea 2 aken on new dimensions with the works of Black and Scholes, 
„esses pa in which the principle of pricing by 
Ce e oll, 


simulation is introduced. 
an rd simulation here is taken in a Specific sense.) 
° o 
e W 


( ressed mathematically by representing the contingent claim 
This 3S oe integral with respect to the underlying stock price. This 

as À one of the complete market assumption of Harrison and Pliska 

jg the objec ertheless, stochastic calculus enables us to Study models in 

1981]. pales cannot be completely simulated. | 

which the ¢ Bouleau and Lamberton [1989], we take a Markovian frame- 
Following * e explicit formulas. We assume that the stock prices are 

work and voue process. The carré du champ operator allows us to 

unctions © i form to the results. i 

an easily i eae is represented by a Feller process (X;) a 
The ae We assume that the discounted underlying stock price S; 

pa eon which we write as 

is a t 


St = G(t, X+), 


9], and 
j leau and Lamberton, 198 
j table hypotheses [Bou SE 
T a Ta š European with discounted value of the fo 
that the con SUN 
exercise time. l esa 

rks i a te strategy for the contingent claim crea 

self-fina 
ane whose discounted value at time / 1s 


V,=Vo | Js dss, 
[0,r] 


io and (J;) is 
sus hedging portfolio an 
initial value of the h k of hedging 
ee i ae mie The residue at time T is the lac 

an (.F ?)-predic 


Rr = H(Sr) - Vr 


i ariance of Rr: 
We have the following estimate of the v 


; ` bs AY 
(RF) 2 E(Vo — Kar ). 20 


1 eo ex ds 
(D21) „elf ren F(G, G) 


0 
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Np 


where T is the carré du champ operator. T can norm 


L 4 
Ey 
the formula 


ally be r 
OMPuyt 

ed} 

y 


I'(F, G) = A(FG) - FAG - GAF, 


‘where A is the generator of (X;) operating on the x y 
[Pr_+H (G(T, .))](x), (Pr) is the transition semi-grou 


In the estimate (D.2.1), equality is obtained for the unj tie 


ariable, a 


nd 
poraa Gx. 


strategy that minimizes the quadratic risk E[R7], and is abe deing 

ting ed by let. 
Vo = E(A(Sr)|-F 9), 

(D.2.2) JP = (TE, G)/T(G, G)\(t, X,), 


It is now natural to define the value of the claim at time 0 
E(A(Sr)|F 0) = F(0, Xo) and at time t by E(A(S7)|.F 0) = F(t,X,) by 


The residue at time ¢ can then be defined by 


R; = Ft, X;) ra V;. 
For the optimal strategy, the martingale (R,) satisfies 


I(F, GY 


V= [ ( ED TG. 


(S, Xs) ds, 


during the optimal management of the portfolio: 


| f T(F = 
23 E su Resa P (rier me] (NAS 
pee ee a) 


which, by Doob’s inequality, gives an estimate for the maximal residue 
where x = Xo is the starting point of the process (X;). 6 

These formulas extend to the case where there are several stocks W! 
with values in Rd): see Bouleau and Lamberton [1989]. 


| 


y 
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Figure 4.1 Recursively simulated Brownian sheet. . i 


Appendix for Chapter 4 


ne 


The Brownian sheet is a Gaussian process X = (Xs) which can be defined 
as follows. 


Let (yn) be an C.O.N:S. of L({O, 11), and let (gn) be an iid. sequence 
of standard Gaussian random variables. For A € (0, 112) the formula 


OCA) = > gnll Xn) 
n=0 


defines a COSM (cf IIB.1) (Centred Orthogonal Stochastic Measure). 
Then Xs, = i = f JoXa B)dadp,X is 
A E Q([0, s] x [0, ¢]). Putting Wrls,t) = JoJo Xn» : 
also defined by the series 
X= N 8N 
n=0 
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which can be shown to converge in @ ([0, 11°) a.s. Such a Proce 
recursively simulated by an extension of the method expose à: AT 
The idea is to use the conditional law of Xs+h,t+h given C3 


X st) X sth > X s+2h,t» Xs,t+h > X 542h,t+h ? X s,t+2h > X s+ht+2h > Messy. ; 
; +2h 


which can be obtained by a symbolic computer system sing the f 
the vector "Ct thay 


X sto X s+h,ts X s+2h,t> X s,t+h , X s+h,t+h> X s+2h,t+h > X s,t+2h, X s+h,t+2h : Krah 
| 142} 


has the same law as the vector 


VStN1, 

SIN; + VAN», 

SIN, + VIAN: + VthN3, 

/stN, + VAN, 

V/stN, + VthN: + V'shNa + LNs, 

soi cet pe Alle ic 

SIN, + shNy + ShN), 

V/stN, ee ee ea 

SIN, + VIhN + VAN + V'SANa + ANS + hN + V SAN; + hN; + hN 


where (Ni, N2, N3, Na, Ns, Ne, N7, Ng, No) are independent standard 
Gaussian random variables. 


Ss s+h s+2h 


gnome 


Stochastic Differential Equations and 
Brownian Functionals 


In this chapter we investigate the numerical resolution of stochastic differ- 


f 
ential equations and the computation of expectations of random variables | 
defined on Wiener space. | 


A LIPSCHITZIAN STOCHASTIC DIFFERENTIAL 
EQUATIONS: ITO’S THEOREM 


A.1 Notation and Lemmas 


Let (Q,.F;, P) be a probability space with filtration (7). We will assume 
that martingales are continuous on the right with left limits. Let M be an 
(.F;)-martingale. 


A.1.1 Doob Inequality 
Let p, q be such that 1 < p < +%, and 1/p + 1/q = 1. For all t € R,, 


|| sup Mslllp < al/Mellp. 


SSt 


A.1.2 Burkholder Inequality 
If 1 <p < +, then Yt € Ry 


— || sup [Mslllp < 4PM, MY Ilp- 


s<t 


A13 Gronwall Lemma 


u,v:R, = R, be continuous functions and let c 2 0. If 
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Lg 


v(t) Sct Í v(s)u(s) ds VI<T 
0 


then 
t 
v(t) < cexp Í u(s) ds VisT. 
0 


A.2 Lipschitz Hypotheses 
A function f from R+ x Rf to R” satisfies the Lipschitz hypotheses if it 


is Borel and 


VT>0 dCr>0 Vx,yeRŸ Vte [0,7] 


[f(t,x)| £ Crd + |x) 


and 


FG 0=fGMIS Crey: 


A.3 Ito’s Theorem 
Leto: R, x R” > R™ and b : R, x R” > R” satisfy the Lipschitz 


hypotheses. We consider the stochastic differential equation 


{ dX, = 0(t,X1) dB, + b(t,X;) dt 


_(A.3.1) + 


where (B,) is a standard (.7;) — P-Brownian motion. 
Let (U,) and (V,) be progressively measurable processes, and put (when 


this makes sense) 
Í t 
(ZU) =x + | ols, U;) dB, | b(s, U;) ds 
0 0 


and A,(U,V) = E[sup,<, |Us — Vs|?] for p > 2. , 
We begin by looking at hypotheses under which ZU is defined. Usiné 
la +b +c|P <3?-"(\al? + |b]? + |c|P), we have 


y 


ese 
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p 
A 


| otr, Ur) 4B, 
0 


| b(r, U,)dr e|) 
0 


4, by the Burkholder and Cauchy—Schwarz inequalities, 
1 prape f lo(r,U,)’ d j ! d 
F F, r 
age f r) +E ( | e, U,)| ar) ) 


| LP + (pyre PPT | Bloc. U,)Pdr+ P f E|b(r,U,)P ar) 
0 


glsup (FU)s? $3" (1 +E | sup 
s<t s<t 


+ sup 
S<t 


a 


t 
ap ( |x|? + ((4py tP- + PCr | E(1 + sup |U,1Ÿ as) 
0 rss 


Therefore, if U is an adapted continuous process such that Vt 2 


0, E[sup,<r |U ,|?] < œ, we deduce that JU is well defined and, by Gron- 


wall’s lemma, 
FSU |QZU)P1< C(p,T) 


for a finite constant C(p, T). 


We now consider two continuous adapted processes U and V such that 


Efsup |Vs\?1 < °. 


s<t 


Yt e R,, E[sup |U sP] < ce and 
s<t 


The same chain of inequalities gives 


S p 
f (o(r, Ur) - oC, Vr) iB, 


0 


set 


S P 
| (b(r, Ur) — be Voyar| | 


0 


A,( ZU, ZV) <PTE [sup 


+ SUP 
s<t 


and, by the Burkholder and Cauchy-Schwarz inequalities, 
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<2P-!4P y PE [fie U,) — ocr, ar" | 
+2P-1R (f Ur) = Bir; VE ar 
<2P-l4# PE {1727 [loc U, — o(r,V,)|? arl 
-ig {er f w, U,) — b(r, V,)|P arl ; 
Hence we have 
(A.3.2) A, ZU, ZV) < K(p, nf A;(U, V)ds. 


Thus, putting X® = x, X") = ZX, we obtain by induction, for ¢ < 


ACT. aa) < Ar(XO XY) K(p, rye 
? 5 n! . 


Therefore 
sup |x” = xe) erie 
S<t p 


n=] 


and 2i SUPs<z Ix = A) < +00 a.s. 
It follows that X™ converges uniformly on compact sets a.s. and in L? 
to a continuous process X. This implies that ZX converges in the same 


sense to ZX, and so 


X= JX. 


, , nig i ] 
The uniqueness of the solution comes from Gronwall’s lemma, since if U 


and V are solutions that are progressively measurable and 


isup |Vs|"] < %, 


Sst 


Vr > 0, {sup |U s|] < oo and 
SS 


we have, by formula (A.3.2), that 


A 
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L 


A (U,V) < C(p,T) | A.(U,V) ar. 
0 


Ar (U,V) = O and U and V are indistinguishable. 


fore j 
pet : general uniqueness result can be proved using a stopping argu- 
ge Revuz and Yor [1991] and Protter [1990]. eer 
ment | 
pheoren A.3.3 Under the Lipschitz hypotheses on o and b, equa- 


(A 3.1) possess a unique solution X = (X,). The Picard iteration X% 
jor ` 


? p. 
nverges a.s. and in LP: 
co 


sup |X@—-Xs| 0 as. an in P pe [l,o], 


st 


44 Two Properties of the Solution 


Proposition A.4.1 For 1 <p<+c,andt<T, 
IX < CP, DA + |x)). 


Proof We may assume that p = 2q, q > 1. For simplicity of the nota- 
tion we suppose that the dimension is one. Applying Ito’s formula to Le 


gives 


XIE =x + f 24|X |27 (os, Xs)dBs + b(s, Xs) ds) 
0 = 
+ ; | 2404 SDRC 6G, x)" as: 
We know that 
E i IX |" ds STE sup IX |41) < +o. 
0 s$ 
Hence taking the expectation erases the stochastic integral and gives 


: 2¢ "i 
Bix pt <% + CG D| 10 + EXT 


t 
0 


<(1 + Ci) + cn] EUX PO ds, 


à 


274 STOCHASTIC DIFFERENTIAL EQUATIONS AND BROWNIAN 
and the result follows from Gronwall’s lemma. 


: | 
a 
1 


Proposition A.4.2 For 1 <p<+., 0Ss,t<T 


|X; — Xsllp £ cp, DA + xD - s)". 


Proof 


t P 
E -x ps2 |E Í o(r,X,)dB,| +E 


t 
Joe,xar 
S 


] 
t 

<2P-!4Pp P(t — gje] Í Elļo(r,X,)|? dr 
S 


Í 
$ OPK pig)? D Í Elb(r, X|? dr 


<C(p, TN - PPC + 1x1)? o 


mm 


B DISCRETIZATION OF SDEs | 


B.1 Euler Scheme: Convergence and Rate Estimates 


Our aim is to solve (A.3.1) by a more effective method than Picard’s 
iteration. 

For / e N’, let tk = k(T/D), 0 < k < l be a set of points partitioning | 
[0, T]. We define bee on Je, tk] by 


t t 
(B.1.1) XP aX) + | (4, XP) dB, + + | b(t, XV) ds 
tk tk 


with Xo O | 

It is ie to compute explicit expressions for the yee of x at the | 
points of the partition. By induction on k we have x? e LP,1<p<+, | 
and 


1-1 


t 
xP eaf Dinky tt) dBs 


+f Soin x! ir, ut) as. 


pd 
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yemma B.1.2 Forl<p< +00, 


ae : |€ L. 


jeed, if Yr = SuPser IX$?|, then by the B 


. . urkh 
inequalities and the Lipschitz hypotheses older and Cauchy-Schwarz 


? 


Ey? < C | |x|? + (P V ee) | E(P) as 
0 : 
The result follows by Gronwall’s lemma. 
We compute the difference X® — x Pr 
t lı 
l m 
E | 0 S ot, x9) - G(x, Xs)) la t(s) dB 
j=0 
t l-i 
+] LHX? = BO, Xs) pyptjert(s) ds. 
J= 


We now assume the Holder property: 


(B.1.3) 3a > Olo(t, x) — ols, x)| + |b(t, x) = b(s,x)| < Cr + [x|)[t- |. 
Setting Z, = sup,<, |X — X,|, we can write 


p 
l-1 


t 
EZ? <C(p,T) part | SEXP) — OSX Dlia ds 
0| 47 

p 


p | 1-1 
HE] D GX, = ps, X irtja 49 
0 j=0 


For 
S ; 
j (tjs tiil, 
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lot XO) ols, X)| 
s lott; Xip) = ot, X) +106, Xy) — ols, X,,)| 
+ lo(s, Xy — 1%) 
<CIZ, + (1 + 1X, D6- tj) + [Xs - X41] 
and so 
I 
— + — 


Blot), Xy) - 60, Xs)? SCP, T,x) | EZ; + Tap ah 


(because (1 + |X;|”?) < COP, T,x) and E(|Xs — X4;|P) < C(s - tPA) 


Similarly, we have 


1 
ElbG;, XO) — b(s,X,)|P < C(p,T,x) ez; + a ma 


t 


and thus 
~~ t 
aP (p/2)-1 {FP es ib ae 
EZ Cpt. If EZ; ds + Top * A i 


So by Gronwall’s lemma, 
EZ < C(p rT) with B= NO 
es >“ [Bp 2 : 


that 1s 

C 
sup IX — xX, |||, < — 
SUB he rlllo B 


Thus we obtain the following theorem: 


Theorem B.1.4 Under the Lipschitz hypotheses and (B.1.3), the p" 
cess (X; ) converges a.s. and in L’ uniformly on compact sets to the solu 


tion of the SDE. 


n1 x Í, 
It remains only to prove a.s. convergence. Choosing p such that pp? 


we have 
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4, su XO a 

= | : XP S 


C 
[Bp ? 


pich Shows wat 
y 


(1) 
Zip xp 


ntegrable, and hence 
j 


D 
pe XP-X|>0 as. 


g.1.5 Remark 
The approximating process X™® = (x) 
the points tk = kT/n of the partition. 

If we ask for an approximation of the diffusion over the entire interval 
[0,7], it is simplest to take a linear interpolation between the points 


Xh -This linearly interpolated scheme has almost the same rate of con- 
vergence as the preceding one: 


As previously, we define for t = kT/n,k =0,...,n—1, 


O<r<T is easy to compute only at 


fork =0O,...,n-—1 


| Xi I roA Cr B HG a 
and X 0° = x 


and for t € [fg,txxl, k = 0,...,n— 1, put 


t— fk (n) (n) 
(A ini E Xi, ). 
fk+1 — tk 


Then, under the Lipschitz hypotheses and assuming that (B.1.3) is satisfied 
With œ > L 


PEA 1 +log n >] 
| sup ix, = X™ IIlp <C m fs ds 
te [0,7 | 


‘onstants of the Lip- 
Siete C is a constant depending on p,T and the constants of the Lip 


nn. 
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schitz hypotheses and of (B.1.3). We refer to O. Faure [1992 Ch 
for a proof and extensions of the result. ” "ter IV] 


B.2 Example: Wiener Iterated Integrals 


Let B, = (B},..., B%) be a d-dimensional Brownian motion. Consig 
er 


SN? 
O<Ss1 <+ <SN <t 


where the indices i; are chosen, with eventual repetition, from {1 


Let 0<t, <---< t<- be a partition of R, and let 


| 

l 

| 

| 

1 

i 

| 

In(t) = dB} Fa - dB 
rad) | 


t | | 
rO STOLL: | 
0 k | 
Í 
Paz | D Dienn) dB? 
vi 
Í 
IV) = Í S Piga an. 
ae 


By induction, we get | 


G2 PGs >. Ba r 62.382) 


~ Oy 4l lj2+1 Tin 
ji <j2<--<jN <k 
iN IN 


| 
{ 
| 
| 
i 
Proposition B.2.2 IN (t) converges to /y(t) uniformly on compacts | 
sets almost surely and in L? as T = sup;(tk+ı — tk) tends to zero. | 


Proof If we set X] = B, X$! = [ X*dBs‘ the process Xı e 
(X!,...,X!) is a diffusion process solution of a Lipschitz SDE driven / 
the multidimensional Brownian motion B. The above discretization 1S that 
of Euler scheme. Putting t = sup;(tk+ı — tk), we get the estimate 


mi mn sg 
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sup |Z a 
| ot nS) = Olly < C(p,N T)V/% 
> 3 t 


gxample of a Multistep Scheme 
a dd d-order Langevin equation 
X( = YO 
(t) = -YOX C) + b(t, X(t) + olt X(t))B(t) 


a(t) is a White noise 
, describes the motion of a particl 
icle in a noi 
se- 


where P ( f feld with fri 
rbed force Te with friction. It can be interpreted 
ed as the vector 


A chastic equation 

aX;= Vidt 
() fav = ee ee dt + o(t, X(t)) dB; 
Let 


B(t) = exp Í ys) ds, 
o 2 


ose a partition of 10, T] into / intervals Jtk, fkn]. We assume that 


and ch 
and define 


yis differentiable 


x? -gorf + | r% x ry) t 


t S l-1 1 
+ | (J >. BC) | 7 (2y (tj) + PDX 
j=0 


0 0; 
+ px) ipit ds 


1 
: f (] > papolt XP ua a) ij 


o\ 40 0 
Theorem B.3.1 Suppose that © and b satisfy the Lipschitz com 
and the Hôlder condition (B.1.3)- Suppose also that there exis 


Such that 


~ 
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Q 
IO- YO s Kli- sl". 


Then for any p 2 2, 


c 
I| sup |X? — Xslllp < Tni: 
ssf 
Proof (a) Suppose first that y= 0. Then 
ay are 
XO =x+vt+ | Í Yot XP uaar Jas 
o\ 4047 


Í 
+ [ (| Moe, XP tt: a Ja 
0^ 
J=0 


0 
Í 
x=x+vt+ | ( 
0 


By the stochastic Fubini theorem, if s < t, 


S S 
1122 ar+ | SU X dB, ) ds. 
0 0 
Í 
X,—-X;,= v(t —-S) + | {@—svr)[b(r,X,) dr +6(r,X,)dB,] 
0 
and as in Proposition A.4.2, 


BIX, xs | ble = sP 


+B] 


+E If (t—svr)b(r,X,)dr 
0 


SC(p,T,x, v)|t — s}. 


f 
J 


t 
Í (t-svr)olr,X,)dB, 
0 


It also follows from the stochastic Fubini theorem that 
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f l-1 
z Pay m4) 
y” — Xi f Event ) 3 BE, X ir) dr 


j=0 


1-1 
40) 
+> (oly, Xt ) = O(r, XM) | 


ul 
i f Zt = sup Ix $ Kk 
St 


Then 


t 
p t E|. 
E[Z;] < EAE vif E(Z?) ds + T + z] 3 


by the Gronwall lemma, 
C 
PAIE < Jan 
(b) Suppose finally that y satisfies the condition of the theorem, and let 


(X, V) be the solution of (*). We put 


Y, = BOX: 
W,= SY OROX: + BOV.. 


Then we have 


dy, = W,dt 
dW; =; vO +y°(t))Y dt+ BODDE, X) dt + o(t,X:) dBi] 
4 


icti ation (*) with b and © 
and so (Y,,W,) is the solution of a frictionless equation ( ) 


replaced by 


TE ay-ly 
b(t, x) = 2Y (0 +y? (N + Bb, BO) x) 
9 aad 4 i 


zi x) = Bot, BOV 2) 


> 
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It is easily seen that b and © satisfy the Lipschitz conditi 
the Hölder condition with œ ^ 1, and we have the approximati 
ON re 


from (a): Sult 
(D) 
I ee - Ysllp < Jani: 


Since B is a positive continuous function and 


yo = = B(HX, 


the proof of the theorem is complete. O 


B.3.2 Comment 
As is usual when dealing with second-order differential equations our 


approximation scheme is multistep. In fact, with h = T/I, we have: 


yo =% 
(J) D -k 8 
BCX — Xo =v + bo +00 (ti — s) dB, 
0 


7 _ fr 
Bete XO, + B-XL,- 2BGIX = bi- + bi) a 


i+] 
ti 
+5 | (G-11)d8B, 
i+] 


+ Gi (fi — 5) Bs 
li 


where 


Bi = à OYOD + Ue) BUEDX.? + BE BLE XD) 


5; = BtoC, XP). 


In the one-dimensional case, the random variables 


tj li 
Í (ti -s)dB, and | (s — ti-1)dBs 
t 


ti] 
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ssian ariance h? 


wt Oily be simulated. /3 and covariance h?/6, so they 


po Not Use the Shift Method Along the S 
| hat, by a discretizatio ample Paths 
suppos? that, i n method similar to those in th; | 
» were able to replace the problem of com in this section, 


‘an functional by that of computing th puting the expectation of a 


ni 
prow expectation of a functi 
of the form nctional 


F = f(B;,,B;,,. -.,B;,,...,BT), 


where 0 < tı <... < In- < T partitions [0,T]. 

The functional F is finite-dimensional, and the random variables 
Boln Bn» -- -BT — Bi, are independent Gaussian with known vari- 
ance. It is therefore easy to simulate F on ([0, 1]",.A (0, 1}”), dx") 

if n is large, as explained in Chapter 2, Section C.5, quasi-Monte Carlo 
methods are no longer applicable and the only available methods are the 
classical Monte Carlo and shift methods. 

There is no difficulty with the Monte Carlo method but, as we have 
mentioned, we must draw all the variables B;,,i = 1,...,n independently 
at each step. 

Concerning the shift method, we have seen in Chapter 3, Section A.7 
that it is useful not only in infinite-dimensional but also in finite-dimen- 
sional cases. But its application can be carried out in several ways (cf. 
Chapter 3, Section A.7) yielding different rates of convergence. 

One of these ways is particularly bad: we suppose for simplicity that 
tix) - ti = T/nVi, and consider the approach that involves changing the 
iid. sequence 


B;,, Bry g Br, a ss Biazi E Bin» Br g Bini 


into the sequence 


T 
B;, — Ba, Br A B;, ee BT Breit 8» 


ependent of the preceding vari- 
d path is very close to the pre- 
—t;=1/n tends to zero 


y l . . 

i g is a standard Gaussian variable ind 

ae In such a method, the new discretize 
Mg one at every step, and, as the increment fi+1 


> 
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the path remains just as close! Practically, it can be tested b 
that the rate worsens as n increases [Ben Alaya, 1992], y “XPerimen, 
To overcome this difficulty, the successive points of Wiener 
by the approximate Brownian paths yielded by the shift must b oo given 
from each other in the Wiener space. It is possible, for exampl i far away 
late the Brownian path recursively (cf. Chapter 3, Section C3) a Hy simu. 
when the random function is called: each recursively Salaa tO shif 
mated Brownian path is obtained from a random function Shifted 7 Pproxi, 
It is also possible to expand the Brownian motion in series e Step, 
Haar functions or with other bases of L7[0, T]. Here again, there are 2 
ways to implement this, depending on the chosen method of indexing 


B.4.1 Remark 
(1) As we explained in Sections D.5 and F4, the accuracy of the di 
S- 


cretization we get from expanding the Brownian motion in a series ma 
depend on the sample path (cf. also 0. Faure and J. Gaines [1992]). In re 
case the shift method uses an infinite-dimensional product space [0, 1 T as 
explained in Chapter 3, Section A.5. | 

(2) What we have said above about Brownian motion can also be said 
about processes with independent increments and Markov processes: do 
not apply the shift method on an iterative simulation because the procedure 


worsens as the discretization becomes sharper. 


C IRREGULARITY OF BROWNIAN FUNCTIONALS 


In order to discuss the value of the preceding approximations, and espe- 
cially of the almost sure approximation, we must first investigate the reg- 


ularity of typical Brownian functionals. 
Let B, be a standard real Brownian motion defined on the Wiener 


space Q = o((0, 1], R) equipped with the uniform topology and the 
Wiener measure P. (The subscript 9 in Z0o([0, 1]) means that elements of 


Co([0, 1], R) vanish at zero.) 


C.1 Lack of Continuity of Stochastic Integrals 
Proposition C.1.1 Let f € L°([0, 1]). If the random variable F Se 
j f(s)dB, possesses a continuous version, then there exists a bounde 
measure u such that u({0}) = 0 and f(s) = Ws, 1]) Lebesgue a.s. 


For the proof we need the following lemma. 
(0, l), R), 


Lemma C.1.2 For any nonempty open subset G of ol 
P(G) > 0. 
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of the uppose fir 


0 f ; St that 
Pr grownian motion on R, and for q > B 


is a stan , 
| dard one-dimen. 
jon? 


SO 8/24, and 0e ve 
d T = inf {s20 : [B,|>a) 


2 
A) Š 


ss itis ihe real part of a complex continuous martingale, Z is a real 
vox ous martingale. It follows from the optional sampling bedem that 


ElZr\e] = E[Zo] = 1. 


we remark that 
cosaBr,; 2 cos œa > 0 


Letting t > ® WE obtain 


| 02 
E [o 5 r| = (cos oa). 


Letting & —> 1/2a we observe that T cannot be bounded a.s., and hence 
for any t 2 0, P(T >t) > 0. | 

(b) We now return to the Wiener space Q = ALO, 1], R). From (a) we 
deduce that for any open ball B(0, a) in Q we have P(B(0, a)) > 0. Now 
let h € L’([0, 1]) and put 


| t 
| «D | h(s)ds, OStS1. 
0 


: : at the contin- 
It follows from the Cameron-Martin-Girsanov Laue seers 
uous process (B, — g(t),0 < t S 1) is a Brownian mo 


he new probability measure 


| | l 
| : l 1| h yeas} Ae. ®. 
| P(A) = E | 14 exp rf i h(s) dBs - aK : 


| As p te , , 
P and P are equivalent, and 


286 STOCHASTIC DIFFERENTIAL EQUATIONS AND BROWNIAN FU à 
Nery 
ONAL, 


P(B(g, a)) = P(BO, a)) > 0, 


we deduce that P(B(g, a)) > 0. Since the Cameron—Martin xs 


l : Pace ; 
in Q, the proof is complete. 'S dense 


Proof of the Proposition: Let F be a continuous function on 
that “i anch 


1 
= Í f(s) dB, as. 
0 


The function À : @o([0, 1], R) — R is such that if (B!) and (B 


2\. 
. : . r) are tw 
independent Brownian motions and œ belongs to [-1, 1], then : 


F(oB! + V1 — 02B’) = aF (B') + V1- 02F(B’) a.s. 


Both sides are continuous maps from @0((0, 1], R) x #([0, 1], R) to R 
and are, therefore, everywhere equal. Thus 


Fox + V1 — o2y) = oF (x) + V1 - o2F (y) 


for all x,y € €o([0, 1], R) and œ € [-1,1]. Taking y = 0 gives F(0) = 0 
and F(ax) = aF (x) for a € [-1, 1], and hence for all a € R. So 


F(ax.t by) = aF(x)+bF() Vx,ye Go({0,1],R) Va,be R. 
Thus F is linear and continuous and there exists a bounded measure p 


such that 1({0}) = 0 and Ê (œ) = L(@)V@ € Q. Integrating by parts, the 
equality [(@) = i w(s)u (ds) gives 


uco) = | H(s, 11) dB, as. 
0 


It follows that f(s) = u(]s, 1]) for Lebesgue almost every s. o 


Corollary C.1.3 Let B' and B° be independent Brownian e 
and let G be a Borel version of (2! dB?. Then the set {an GO 
is continuous} is negligible. 


RP a wah 


dnsiimasiissaisetr einen 
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oe foll f i 
This follows from the proposition si 
ite variation} is negligible. n since (œ; : s —> @i(s) has 
pou" 
No n-Riemann-Integrable Stochastic Integrals 
ill show the irregularity of the iterated integral 


1 1 
F(@) = I, B;dB (- | 0); (s) doo) 


ed on the two-dimensional Wiener space (Q, F, m) = 

sta) with Q; = E((0 1, R) i = ds à 

This integral can be considered as a generic example of functionals of 
diffusion processes solutions of an Ito’s type SDE. Indeed, on the one 
hand, as We shall see later (Section E.4), these diffusion processes can be 
written as the sum of a series of iterated integrals, and on the other hand 
the approximate continuity at smooth points of the Ito map, which defines 
diffusion process (Stroock—Varadhan theorem, cf. Ikeda and Watanabe 
(1981, Theorem 8.2, p. 432]) can be proved from that of F alone. 


Proposition C.2.1 Let Q be endowed with the uniform topology. For 
every nonempty open set G and every a € R, 


P{oe G:F(œ)>a}>0 
and 
P{@e G : F(@) <a} > 0. 


Proof We introduce the process (Lévy’s area) 


S(t) = 5 [fo do(s) — [ (s) do) 


and the radial process 


r(t)=~V (œ (s)? + (œs). 


It 
follows from Ito’s formula that 
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sr) = Í r(s) dE(s) +t, 
0 


where 


= do(s). 


al) t @(s) 
o= f 6) dons + | © 


The bivariate martingale (S(t), §(t)) satisfies: 


(S,S) = 4 | r(s) ds 
0 


(S, 37 =0 
(a 37 =. 


Thus, by Knight’s theorem on the representation of continuous martingales 
as time changed Brownian motion, if K(¢) is the inverse of the increasing 


process t > 4 f r(s)? ds, then 
(nr, Ed = (S(K()), 6) 


is a bivariate Brownian motion. In particular, the process (S(K(t))):20 1 


independent of the process (E(t)):>0- 
Since r? is the solution to a stochastic differential equation, where O 


is Hôlder continuous with exponent 1/2 and b is constant, we observe 
[Yamada and Watanabe, 1971] that r is adapted to the P-completed fil- 
tration generated by the driving Brownian motion E. Therefore (S(K(1)))iz0 


is also independent of r. 
Lemma C.2.2 Let 


Ít t 
hit) = (hy (t), h2(t)) = (| TAES as, | u(s) as) 
0 0 


with ui, uz € L?([0, 1]). Then 


Ya e R,V8>0, P{F(@ +h) > a, [lol < 5} > 0. 


Y 
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pk 

| of the Lemma We have 
proof 


l 
= | @(s)dw 
F(& +h) f IMS + | 


l 


i 
M6) do(s) + | 1 (s) dhy(s) 
0 


l 
“| hı (s) dh;(s). 
0 
we remark that 
l 
| 
| | W1 (s) do (s) = $04 (1)o9(1) +S 
| 0 
| m ! 1 2 
201(L)@(1) > -172(1). 
Moreover, 
1 1 
| hy(s) da(s) = hy (D@(D) - | con(s)un(s) ds 
0 0 | P 
> — AMIA) 1) - (f poas) 
j 1/2 
. (| u‘ (s) ds) 
0 
| 1/2 f x jas) 1/2 
' 2 u5(s) as 
| &,(s) dh2(s) 2 — (f rs) as) | 0 
0 
50 that 
F(w +h) 2 S(1) + HU), 
Where 
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H(r)= - tr) - A) (21) v 1) 


1 1/2 
= (| r*(s) ds) QT 
0 


J 
+ llull) + Í hds 
0 


Hence 


| J 
P[F(@ +h) >a, |||]. < 8] > P È | f r*(s) as) + H(r) >a: loll. < 5 
0 bi 


and this last probability is strictly positive, because N is independent of 
and the function de 


f(a, B) = Pina) - B > a} 


is > 0 for œ > 0 and Be R. So, this probability can be written as 


E | enea if ( if r(s} ds, À #0)] 


E 


and the result follows from the fact that P{supo<;<; r(s) < 5} > 0. 


Proof of Proposition C.2.1 (end) Let G be an open set in Q. The set 
of Cameron—Martin functions H = {h : h(t) = fo u(s) ds,u € L*({0, 1], R°)] 
is dense in Q. Let h e H, h in G, and let P, be defined by 


E 
Pa = exp] Í uy (s) den,(s) + [ POY CE sf laa] -P 


0 


(w(t) — A(t))>9 is a standard Brownian motion under P}. An applicati? 
of Lemma C.2.2 gives the result. 
bounded con 


In particular, by the proposition, if @ is any nonconstant do i is 


tinuous function from R to R, the random variable 9 i; OF 
not Riemann-integrable. 
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«4,3 Remark irf 
C2 area SCO = ziho %0) dos) f osa 
0 ®1(s)] can be simulated (in 
srobability space: Te (a > O gO ous random variable on 


à t J20 1S a Standard Brownian 
the same law as 


e SÊ 
motion 
(1) 


1/4 IBP 2488 


À 
(C+ ) s )*Jds 


ihe property quoted after Lemma C.2.2. Furthermore, we can see by 


computing the characteristic function that S(t) has the same law as the 
‘andom variable 


t 
1 
C25) By’ i (eer? + go as 


if (Br) is a standard Brownian motion, the law of the random variable 
A B° ds is known. This was historically one of the first applications of 
he formulas for absolutely continuous change of probability measures on 
Wiener space [Cameron and Martin, 1945]. 

We have 


t t 
E exp l- Atan [A(s — t)]Bs dB; — f X tan? [A(s — t)1B? as} = 1. 
0 0 


However, by Ito’s formula 


t 
~ a| tan [A(s — t)]Bs dBs 


0 
t 
Š W | O Bs PO | tan [A(s — t)|ds 
~ 2 J, cos? [Ms - 9] 2 Jo 
r2 


2 | ` x 
= | (1+ tan? [AS - DIB; ds + 5 log | cos (M)|. 


0 


Thus, substituting, we get 


~y 
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and so, by analytic continuation, 


eee 
Ej exp (| == | B?ds)| = ——___ 
2 Jo Vcosh At 


Returning to (C.2.4) and (C.2.5), we get 


2 t 
ius u“ 1 
Ees” — E| exp | = J tory + (BY)] as) 
1 


ut 
cosh — 
2 


and so S(t) has a density given by 


1 
t cosh Tx/t` 


Un autre problème, dont j’ai commencé l’étude en 1939, et dont j’ai donné la 

solution complete dans ma communication au Second Berkeley Symposium on 

Probability and Statistics (1950), est relatif a laire S(t) comprise entre l’arc 

A(Q) A(t) et sa corde, c’est a dire puisque A(0) est en O, Paire balayée par le 

vecteur OA quand le point A décrit cet arc. 

Il est naturel d’inscrire dans l’arc A(O)A(t) une ligne polygonale AgA,...A, 

a côtés tous très petits, et de remplacer l’arc par cette ligne pour avoir une 
approximation de S(t). On introduit ainsi des sommes de termes de la forme 
sRyEy /dty, Ry désignant la distance AgAy et Ey/dty désignant la hauteur du 
triangle OAy_ Ay issue du sommet Ay-_1; &v est une variable gaussienne réduite, 
indépendante de Ry. Une telle somme est alors de la forme EVER, & étant tou- 
jours une variable gaussienne réduite. A la limite, on obtient pour S(t) la forme 


t t 
UE, où atid ts PX W)duJ = YY du, 
0 0 


I,J et U étant indépendants (donc aussi U et 6). 

/ , . ¢ 7 . NS Pas à wea tj 5 une 
Ce résultat est indiqué dans mon mémoire de 1939, ou J al aussi forme a 
r P Pa Gee: á ar T n ° oe res ol 
équation aux dérivées partielles vérifiée par la fonction de répartition CON] 

y ‘ P 9 , n # , Ps \ te ue 

de R(t) et S(t). Mais je ne l’avais pas intégrée, et ce n’est que plus tard que J 
pu, et même de deux manières différentes, déterminer celle de R(t). 
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geo” 
‘sre utilise la f.c. 
La pe en déduire Sa ae , obtenue En 1944 par C i 
est Í 2). C’est une loi qui é Qui est P(z) ameron et Martin. I] 
| cosh (tz/2)- © 101 qui était déja connu > €t celle de S(t) = UE ai 
€ > 1 est 


f densité de probabilité est f(x) 
S 


c3 Other Topologies on Wiener Space 


ess that the dj Te 

2 might eu ISCONtinuity of | 
oS) dw?(s) comes from the fact that uniform ra ee F(@) = 
‘s oo weak and has to be reinforced. potogy on Wiener space 

he followin 

Nevertheless t 8 result can be show | 
gid dt beas TS me proof of proposition C.2.1, as En pee 
uch that H is dense in B, B c Q, P(B) = 1. e a Banach space 


; = Then there is n j 
of F (ie. F = F P-a.e.) such that F|g is continuous. o version F 


We refer to seas s article for a characterization of Wei multiple 
integrals G for which such a Banach space B exists on which G is an 
tinuous. | 


c.4 Importance of LP Estimates 


Let us consider a SDE of Ito’s type, given by equation (A.3.1) with regular 
coefficients. It is known [McShane, 1974; Doss, 1977; Sussmann, 1978] 


Another problem, which I began working on in 1939, and gave the complete solution to in 
my communication to the Second Berkely Symposium on Probability and Statistics (1950), 
is concerned with the area S(t) bounded by the arc A(0) A(t) and its chord; in other words, 
since A(0) is at O, the area swept out by the line OA as the point A describes this arc. 

It is natural to inscribe in the arc A(0) A(t) a polygonal line Ap A1 -.- An, with very small 
sides, and to replace the arc with this line so as to get an approximation of S(t). We also 
introduce sums of the form 5 RySyVdty, where Ry denotes the distance ApAy from the 
comer Ay_,;&, is a reduced Gaussian variable, independent of Ry. Such a sum is then of 
the form EV ER2?, where € is still a reduced Gaussian variable. In the limit, we obtain for 
S(t) the form 


t t 
UE, where U?=1+J,1= SY XW) dud = a du, 
0 


where I,J and U are independent (and so U and & are 8 wel) Tha sur PA 2 z 

1939 treatise, where I also give an equation involving partial TAAT 

Joint distribution function of R(t) and S(O. But jdid ne Me ae eal function 

wit two different ways, to determine that of RU): a are Le easy to deduce that of 
2) of J, obtained in 1944 by Cameron and Martin. ane This was already known. It is 
> Which is @2(z), and that of S(), which is 1/cosh (12/ JD). 

Naturally ac. and its probability density is f(x) = 1 /(t cosh (7X7. 


~ 
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that if the Frobenius commutativity condition 


Vre R” lowu = "oww uve Rd 


is satisfied, then a version of the solution can be chosen such th 
to 


mapping 
(x, ©) > X*(@) e ZO, 1], R”) 

is locally Lipschitz. In particular, this occurs when d = 1. In thi 
whenever @ is a bounded continuous function from ELO, 1], R”) a 
random variable @(X*) is Riemann-integrable and its expectation ee 1 
computed by simulation with an almost-sure approximation scheme 
that of Theorem B.1.4. There is a large literature on this subject; see Fe : 
cially Mc Shane [1974], Clark [1980], Pardoux and Talay [1985], Mine. 


[1974], Sussman [1978] and O. Faure [1990]. 
Now, as we have seen for a particular case in Sections C.1, C2 and 


C.3, if the commutativity condition fails, then functionals like @(X") are 
no longer Riemann-integrable and their expectations cannot be computed 


by direct simulation. 
The general method is therefore to use Z?-approximation results to find 


a continuous random variable Fe such that 


IEF. - Ee(X*)| < 5, 


and Fe can generally be defined on a finite-dimensional space. We can 
then compute EF, by simulation with an accuracy of €/2, which gives 
the desired result up to €. The computation of EF, can be performed by 
classical Monte Carlo, by quasi-Monte Carlo, or by an ergodic transform 


as is used in the shift method. 


C.5 Sequential Integration by Picking Smooth Functions in the 


Wiener Space 

Under the expression “sequential integration” we group together all F 
numerical methods of computation of the integral of a function f as the 
limit of average of its values at a sequence of points (Ga): 


N 


ae | 
fr du lim à - FGn)- 


n= 


cou 
nite-dimensional case ( f : [O, 1}5 295 


ine Ë >R sent 

ele a ao f to which here ru methods are 
perl sate y nasi Monte Ga can be classified in ra | as well as for 
Monte a 993] (low-discrepancy sequ main families: 
pouleau» 19 “i : q 4 ences), and shift 
Pg already discussed, in the infinite-dime f | 
on 1° eae WAP) = (EO, 11, R), BE, Hen 
which will be he main framework, the extension es » 1], R"), P) 
. „mediately r set difficulties: ese methods 

A lot of interesting random variables bss 
i als, solutions of SDE’s, etc.) are EE E eas (stochastic 
so, if (Én) is a sequence of points in the Wiener space, the site equality. 
(En) ÍS not defined for such f’s. » the pointwise value 


fr 


(b) These random variables are very irregular and their Borel ver 
sions are not P-Riemann integrable. For example, for the bidimensional 
prownian motion @ = (œ!, œ?) the random variable f(@) = f ol do is 
such that for every nonempty open set G and for every a € R a 


nsional ca 


PIGN {f >a}]>0 and PIGN {f <a}]>0, 


and for every Borel version F of F, the set of continuity points of F is 
empty. 

(c) Even when f is continuous, since a typical Brownian path is essen- 
tially a limit concept, the value f (œ) is computable only when f is simple 
(Le., simulatable in the sense of Section D below). 

Fortunately, the problem is not completely hopeless because of the 
approximate continuity property discovered by Stroock and Varadhan 
[1972]; see also Hargé [1993]: 

Let X, be the solution of the SDE in Stratonovitch sense 


WN ka p'(X))dt 

dXi= X 0; (X1)odB, + b'(X:) 
2 

Aga xs 


i i pi th bounded derivatives. 
Where the coefficients o}, b' are Ÿ ”: bounded with 


| e. " 
dXi = So (X1) 0B + bi(X,) dt 
k=] 


Xo = X 
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p= the sol i i i 
(QD) ution of the ordinary differentia) NALS 
“qUation 


dYi= Y YO ar + b'(Y;)dr 
k=1 
Yo =X 


for pe €*((0, 1], R”, (0) = 0. 
Then 


P _ 
| IX; — Y,(@)| > € a lalt) — ph] < J 0 


when 6 — 0. 
By this result, it is possible to hope that for some Sequence of sm 
Ooth 


functions (@,) “dense” enough in (W, HP), 
Le 
EX, = lim — Y(n). 
I= one > (Qn) 
See in this direction Bouleau [(12)]. 


D SIMULATABLE FUNCTIONALS 


We consider first the problem of computing the expectation of, say, a stop- 
ping time T = inf{n 2 0: X, € A} of a Markov chain (X,). If T is 
integrable, then we can apply the general method above, and we can in 
general define a bounded stopping time Te such that |ET, — ET| < €/2, 
and then compute ET: by simulating the chain. But, as we have seen, 
since T is a finite stopping time, such an approximation is not necessary 
and it becomes tractable to compute ET directly by averaging the values 
of T on independent samples. In this section, we study in more detail the 
random variables which can be pointwise simulated directly. We begin 


with discrete probability spaces. 
D.1 Simulatable Functionals on {0,1} 


Let (Xn)nen be the coordinate mappings of {0, 1 
usual, we define the o-fields . Æ n = O(Xm, M < n) and the pro 


bility P = (380 + 18, JON on ZO oas 


IN onto each factor AS 
duct proba- 
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pefini Ai oe existS a se variable F from {0,1 \N to R i | 
imp"? me almost surely finite (4 ,) ee 

ne inst is 0 7-measurable. 7@ n)-Stopping times T 

pe we must often assume that F is b 

sh fact co ounded , 

F = Dino F 1T-n, the law of F is B Chapter 3, Section 


p Sie 
k=l 


an be shown that an (.4,)-stopping time S and an . 5-measur 
spe random variable G with law u can be constructed such that ES is 

A, For x € (0, 1], let {x} be the fractional part of x, and put 
we 2"x}/2". Then the minimal value of ES (finite or hare is 


vi) = An? ao, 1976] 


(Kouth an 


co 


DAT 


k=1 


In general, however, the laws of the simple random variables to be simu- 
lated are not known. This is precisely why the simulation is useful. 


p2 Simulatable Functionals on [0, 1)" 


We now adopt the hypothesis that it is possible to pick from [0, 1] a 
sequence of points independently and according to Lebesgue measure. 
We use this for reasoning about the simulation algorithms, as does, for 


example, L. Devroye [1986]. 

Let (U,),>0 be the coordinate mappings of [0, 1, let Fn = (Umm < 
n), and let P = dx®%. We will consider random variables F defined on 
(0,1}" which are Fy-measurable for some (.F,)-stopping time T that is 
P-almost surely finite. But which regularity must we demand on F to get 


an interesting definition? 
We must, at least, suppose that F is P-Riemann integrable (on [0,1] 
“quipped with the product topology) S0 that, outside a negligible set, the 
Value of F at a point x can be approximated by an approximate knowledge 
of x. However, it is not difficult to define quite explicitly a Borel mapping 
X from (0, 1] into [0, 1]" such that % iS continuous outside a negligible set 


and . 
that the image of the measure dx is ax 
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LD re xk € {0,1} 
k=0 


be the canonical dyadic expansion of x € [0,1], and define 


. = x 
y=x(x) tobe y=(On)n20 With Yn = ` batts 
+ 
k=0 y 


Thus if we suppose only that F is P-Riemann integrable, it js Possip 

return to [0,1] by considering F o x. But, since the digits of y P le to 

on too faraway digits of x, this is in practice useless for the simulatie t 
N of 


functionals of random processes. 
So we must ask more regularity on F than just Riemann-integrapjj;, 
J. 


It seems reasonable to take in consideration the following definition: 


Definition D.2.1 A random variable F from [0, 1] to Rf is Said tob 
simple if there is an (.Y,,)-stopping time T, dx®" -almost surely finite us 
that the sets {T = n} have negligible boundaries, F is .7r-measurable and 
for every n, F is continuous in the interior of {T = n}. | 

As before, we must often suppose that F is also bounded. It follows 
from the definition that F is continuous on a set of measure 1 and is there. 


fore Riemann-integrable if it is bounded. 


D.2.2 Example 
Let X be a random variable with values in R and continuous (or Riemann 


integrable) density f on R. If X is simulated by the rejection method with 
a random variable Y with continuous density g such that f < kg, and Y is 
simulated by Y = @(x), x € [0,1], with @ continuous, then we can write 


T = inf{2n + 1: kU ona) 2 O O(U 2) sf O (Uan)? 


and 
X = Q(Ur-1). 


T is a stopping time with geometric law, the sets {T = n} have negligible 
boundaries, and X is simple in the sense of Definition D.2.1 


More generally, 
N BR) 


Definition D.2.3 Given a probability measure u on (R a 
and a Polish space (E, € ), we say that a measurable mapping G 


y 
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Ny) into (E, €) is simple if there i 

PR. : S a stopping time S for th 
( „tjon Gn) eee n) of RY, u-almost surely finite, ae 
je the poundari ie n} are negligible and G is Y,-mea- 
shat ie and continuous on the interior of {S = n} for every n i 


regularity Properties of Simulatable Functionals 


topping time in the definition is not just almost surely finite but 
ae where finite, the laws of simple random variables depend contin- 
ae on the perturbations of the random number generator and depend 
vnalytically if the Stopping SMe possesses an exponential moment. We 

> first the case of {0,1} and suppose that the generator yields inde- 
pendent pits with law po; + (1 — p)ôo for p € [0,1]: 


ff the $ 


proposition D.3.1 


(i) Let F be simple on {0, 1}" with respect to a finite stopping time T. 
Then the distribution function H,(t) = P,{F < t} under probability 
P, = ( pd, + (1 - p)60)%" is continuous in p, for p € [O, 1]. 

(ii) If, in addition, there is an & > 0 such that 


Eyl + 20)! ] < + 


then H,(t) is analytical in p in an open set of the complex plane 
containing ]} — Q, i + ol. 


Proof (i) With the above notation, we have 


H,(t) = X PF <t} N {T=n} 
n=0 


PHF <t} N {T = n}] is a polynomial (of degree < n + 1) that is positive 
on [0,1]. Therefore p > H p(t) is 1.s.c. and, by the same argument, SO is 
P~>1—H,(t) = P,[F > t], and hence p _> H(t) is continuous. 

(ii) Let M, = IRo + 2P- 1)(2x, — D] be the martingale, which is 


th . 
e density of P, with respect to Pip on + En. 
We have 


H(t) = D EM lrs] iren)] 


n=0 
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Putting q = p — 5, the absolute value of the expectation 


n 
Ey/2[Mnl (rst) 1preny] = En lig + 2q (2x — rs Lira 
k=0 


is bounded by (1 + 2|g|)”"*'Pipl{T = n}l. 

H,(t) is therefore the sum of a series of holomorphic functions on 
|z — 4| < æ} uniformly convergent in every compact set of this do 2 
and the result follows. “a 


These properties extend to .%s-measurable functionals and, in parti 
ular, to simple functionals on [0, 1]: let h be a bounded Borel function x 
[0,1] such that f) A(x) dx = 0 and put C = |||... Suppose that our generator 
yields independent points distributed according to the law (1+Ah(x)).q + 
for À e [-1/C,1/C]. Then we have the following proposition: 


Proposition D.3.2 


(i) Let F be an.75-measurable function from [0, 1] to R, where S js à 
(F,,)-stopping time which is almost surely finite for the probability 
measures Pa = [(1+A-h(x)) dx]®" for À € [-a,a](a < 1/C). Then 
the distribution function H(t) = P,{F < t} is continuous in À on 
[-a, a]. 

(ii) Suppose moreover Eo[(1+aC)*] < +% for some æ > 0. Then H(t) 
is analytical in {À : [A] < a}. 


The proof is similar to the previous one. 


D.4 Simulatable Functionals on a General Probability Space 


In this section we propose a general definition of simulatable functionals. 


La théorie, énumération jamais complète d’un ensemble de règles qui s’amorcent 


l’une l’autre, n’est donc pas le produit de la froide raison, mais de l'instinct qu! 
travaille silencieusement, allumant au foyer intérieur un feu mystérieux dont la 
raison ne fait que recueillir les cendres brûlantes.2 [André Lhote, 1919] 


e each 


The theory, which is the never complete enumeration of a set of rules which initiat re 
t, 


other, is not the product of cool reasoning. It results from the quietly working instinc 
lights the inner hearth a mysterious fire, the burning ashes of which are only gatheres * 
reason. 


N 
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der à Polish space W equipped with its 301 


cons tity measure m. For example, (W, gy Borelian O-field Wang į 
> Z, M) could be the Wiener 


prove canonical space of some stochastic proc 

of ae preceding discussion shows, the hopin: 

„eneral too ee bee tes Simulation. indeed oe ve W is in 

ae ology mu at a fundamental System of abode 

oint is composed of sets described by a finite nu baaa a 

: mbers, that is, by an integer. On the Wiener Space, for ae it 

take simple convergence on rational numbers (cf. Section D.5 below). 
We define a presentation of (W, 7/:m) as 


| i a co | 
structure equipped with a weaker topology: Py of its measurable 


Space 


Definition D.4.1 Let (W, 777m) be a Polish space. A presentation of 
W, g/m) is a mapping from W into R™ such that 


e gis an isormorpnisim of measurable spaces between (W, Zm) and 
(RY, AR ), ) [1.e., U= Qai and JN, m(N) = 0, AN!, u(N!) =0) 
such that @ is one to one and bimeasurable from W\N onto R‘\N']. 

e is continuous. | 


Next we must simulate progressively the coordinate process X = (X n)nz0 
of (RM BRN )\ u) so that if we are given a simple functional G on 
(RY, AOR), u) we can simulate G too. 


. N 

Definition D.4.2 Let X = (Xn)nzo be the coordinates of R”. We 
say that y : [0,1]. > R is a progressive simulation of X = (Xn)nz0 
under y if 


N g N d 
* y is measurable from ([0, 1", 2 (0, 1D") to (RY, AOR )) an 
We(dx®") = D = | mE 
* There exists an increasing sequence of le Se mex ne 
dx® almost surely finite such that Vk > 0 the map “0; 
from [0, 1]" into RÉ" is Fr,-measurable. 


Ww Fm) with values in 


find a presentation p 
the Polish space (E, £) is an s-functional if we ARD, Qe) nto (E, £) 
a, Wim), a simple functional G asa Go Q. 

and a Progressive simulation YW such that / = 
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(W, Fm) 
N 


Ta 


N . N 


[0,1], .A([0,1]§), ax’) 


The preceding definitions will become clearer by their appli 
Cation i 
q 


the context of Wiener space. 


D.S The Case of Wiener Space 

Let (W, HY, m) be the space of standard Brownian motion Vanishin 

where W = @((0, 1], R), ZZ is the Borelian 0-field on W equippeg at 0 

the uniform convergence, and m is the Wiener measure, wath 
We look at several presentations of the Wiener space. 


D.5.1 Dyadic Presentation 
As before, let X, be the coordinate mappings of R“, and put 


Xo 0 p(w) = w(1) 
2k+1 
=} k=0,1,...,27-1: n>0 


Dn+1 


Xank O p(w) = w( 


This defines a continuous mapping © from W to RN (equipped with 
the product topology). The coordinates (X,,) constitute a Gaussian pro 
cess under the probability measure (xm and the law of the vector 
(Xo,...,Xn_1) is a permutation of the law of (w(1/2"),..., w/e 
w(1)). @ is clearly an isomorphism of measurable spaces. {= 
It is easy here to give a progressive simulation y of the process ing 
(Xn) on the space ([0,1]",.21[0, 1]", dx"). This consists of ani a 
Xqn,Xone1,...,Xan1_, given Xo,...,Xon_,, which can be done ee 
Lévy’s celebrated recursive definition of Brownian motion of (cf. C 


3, Section C). 
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(RN, ARN 
a ee ee 


(0,1; BO,1M, d\) 


We shall now see that there are simple functionals G, which define inter 


sting -functionals F=G09. 


p.52 Example 
ror € > 0, WE define 


Tes mt p : sup 


(Rs 
i<k<2" 2” an 7 
and denote by Lon(w, t) the piecewise linear dyadic approximation of order 
nofwe W. So, w > Lan(w,t) is a mapping with values in GE o((0, 1], RÂ) 
equipped with its Borel sets. This is a Polish space. 

Let us put Fe = Lre- 

Then F, is an s-functional: it factorizes as Fe = Ge © @, and we have 
the following lemma: | 


Lemma D.5.3 G is a simple functional (Definition D.2.3). 


Proof It is easily seen that we must prove the properties of Definition 
D.23 for Fe and Te when W is equipped with the topology of ea 
eee on dyadic points and with the o-fields Hn = (Ww k/2"), K = 

es 9 2"). 


(a) Clearly Te is an (W an )-stopping time m-almost surely finite, and 
Fe 1S KH ` 
T,-measurable. 1/2"). 


(b) Let us put Aon(W) = SUP, <K<2" |o(k/2") g o((k = 
© remark that the set 


Ac ={w: Ar.(w) < c} 
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| 


. is open for the topology of simple conve 
if wọ € Ae and if the T e(Wo) points 


k 
CCE). k= Le, Tewo) 


are sufficiently close to the respective points 


k 
(w| z5) k= La), 


then Te(w) = Te(wo) and hence Lr, (w) is closed to Lr, 

Fe is continuous on the interior of {Te = 2” } and it ca 

boundary of {Te = 2”} is negligible. D 
In the same way the functionals 


rgence on dyadic Points l 
. Ndegg 


4 
| 


(Wo). It follows tha 
n be proved that the 


F? =Ly1 where Tọ “int 2 ` 
k 


Fely where J? = ZE i ` 
i k 


e [l—e,1+ al 
D.5.4 Example 


We take again the example of Section C: 


are s-functionals. 


] , 
Fow)= | widw?. 
0 


Notice the following simply proved facts. 


(i) Elf. widw2] ws = (ay, bi), w = (az, ba)] = (ay + 42)/2(b2 = D 
(ii) EL; whdw? — (ay + a2)/2)X(b2 — b1))?|Ws = (ai, bi), We = (a2 I= 
((t ~'s)/12)((az ~ a1)? + (bz — b,?) + (t = 5)°/6. 


LE FUNCTIONALS 


B 
sont” 305 
pul hui 
Fa = EF] = 2 * Marry 
k 2 Wkeny/on — Wen). 
of course Fn > F almost surely and in LP, pe}, eof. 


j) EIF-Fn) lÆ] = 


ë 7 2"_y 
12 1/2 Z hw ((k+1)/2")—w(k/2") 2441/2", 


Then putting V2 for this last expression, we define 


Se = inf {n>0: V2 <2), 


py the same argument as in the precedin 


8 example, Fs, is an s-functional. 
We obviously have E[(F — Fs.Ÿ|.7 


2e] S €* and, taking the expectation, 
ILF ma Fs. |l2 <E. 

D.5.5 Presentation with General Partitions 

Rather than using the dyadic rationals, we can instead use all the rational 


numbers. Let r : N — QN [0, 1] be an enumeration of the rational numbers 
p/q € [0,1] and put 


Xn © Ow) = w(r(n)) 


With this presentation, it is possible to define s-functionals, which depend 
on the points of a partition of [0, 1] whose “deepth” is random, as before, 
but whose “thinness” can now vary from place to place in [0, 1] as a 
function of the values at points already picked. This family includes the 
discretization for diffusion in space studied by O. Faure [1990]. Of course, 
many other presentations of Wiener space can be constructed, each with 
its own family of s-functionals. 


DS6 Remark | 
We consider the program that yields a picture like a au 
hapter 3 showing a 2-dimensional Brownian path (whic a ne 
Y recursive simulation), we get a map from the Wiener m R 
‘Pace of piecewise affine lines of the plane. As it is now clear, 
“Tunctional. 


b 


D 
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In this section, we consider the algebraic features of the stochact: 
culus. Our account is purely a symbolic survey, we do not A ik Cal. 
assumptions, which must be stated for these formulas cut 
are Still partly in the domain of research. 


E.1 Ito-Stratonovich Transformation 
Definition E.1.1 Let X and Y be continuous semi-martingales Th 
$ € 


Stratonovich stochastic integral is defined by 


t 


t 
[ x. o ar. | X;dY, + (X,Y), 
0 0 | 


The following chain rule follows immediately from Ito’s formula. 


Proposition E.1.2 Let X',...,X" be continuous semi-martingales and 


let F be a C?-function from R” to R. Then 
n t 
FOR) RSA 0) >» FO, A 6 dx. 
i=l 0 


E.2 Hu-Meyer Formulas 
We consider a standard (F;)-P-Brownian motion B, = (B},... ,BÎ) with 
values in R°. Let E be the space R, x {1,...,d} equipped with the 
product measure u of the Lebesgue measure with the counting measure 
on ¢1,..i,d}. Lets, bein Le" n°) the space of symmetric functions 
from E” to R which are in L7(E”, 12”). 

We define /,( fn) by 


inant > Í E 
pA Oct cty 


(li, in) € {1,...,d)} 
(x, in)) dB... dBin, 


The set of random variables of the form /,( fn) is a closed subspace 


jc EXPANSIONS OF SOLUTIONS TO spp, 

at In, called nth W; ni 
say, I}, Called nt iener cha 

F p (P). OS and we have the following 

girecl ai 


L*(P) = @ Il, 


n=0 


i pouleau and [1991, Proposition IL2.1.2)). Thus every random 
ce le F in Lf defines a sequence f _ (fn) such that F = 
Sol) nl Fr 
We set 
— 1 
I(f) = De 


n=0 
The isometry of stochastic integrals then yields 


co 


1 | 
O = X = AE 


n=0 


The norm of the left-hand side is taken in L?(P), and that of the right-hand 
side in L?,(E", 12"). i 


A trace operator Tr is defined on a suitable subspace of Lon ee) 


consisting of sufficiently smooth functions, with values in L aE, 
u®"-2), given by 


TER =O 
and, for n > 2 
Tr), i), or CE? in-2)) = > | Talin i1), ee 
(insin—1)€ (1, ... 4}? Ry 


(tn-2, In-2)(s, in-i (S in) ds, 


Where 8; is the Kronecker symbol: 6;,; = 1 if i = j, di,j = 0 otherwise. 
€ Operator Tr can be extended to sequences f = (fn) by 


(Tr(f))n E Tr(f n+2) n 2:0. 
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Let us return now to the definition of J,( fn) and Suppose that the f 
tions f, are sufficiently smooth for the multiple Ito integral to be ites ee 
as successive simple integrals of processes which are semi-martin i 
We can then replace the Ito integrals by Stratonovich integrals, and re 


Sn(fn) =n! D i; Jnl (tiie 


Gels Is ee 
(tn, in)) © dBi! DA dBi. 
For a sequence f = (f,) of such functions, we set 
(E.2.1) sŒ)=1(e4/277f), 

In this formula the operator eT" = Yeo A/kY 1/27 r Operates 
on sequences. Taking for f the sequence where only f, differs from Zero, 
gives 
E.2.2 - a A E 
(E.2.2) Sn(fn) = 2 (n = 2K)!k! 2k n-2k(Tr fn). 


O<k<n/2 


Similarly applying (E.2.1) to the sequence e{l/21"f gives 


(E.2.3) If) = Ste 4/27) 
and 
| n! ii p 
(E.2.4) If n) = 2 a2 | -;) Sn-2k(T FT n). 
O<ksn/2 


Formulas (E.2.1) to (E.2.4) are called Hu-Meyer formulas [Hu and Meyer, 
1988]. We shall explain these formulas in the simple case when d = | and 


fn = h” for he L?(R,): 


Fais ’ pth) ae hti) Ata). 


If as before we set 
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ce the Stratonovich integral Satisfie 


st ! 
i - 1.2), we have he usual chain rule (Propo- 


S(f) = exp i" j 
E25 ) 0 “Ja 
and 
626 ‘ Sn(fn) = (f h(s) dB, ) 


Moreover: if we consider the martingale 
M: = exp f h(s) dB; — f h(s} as]. 
then the Ito formula shows that it satisfies the equation 
t 
M;=1 + | Msh(s) dB, . 
0 


This is also satisfied by the martingale N, defined by 


N,=1+ | A(t,)+++A(tn) dB, «++ dBi,- 
O<ti<--<tfn<t 


nzi 


It follows [Bouleau and Hirsch, 1991, Lemma I.2.1.2.1] that M; = Ny. 


hus we have 


(E.2.7) I(f) = exp| f h(s) dB; — a his) is) 
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Dai («free fra) 
= 1\* . 
-5 ¥ (-) mo (Jr) (Jea 


m=0 0<k<m/2 


By (E.2.7) the coefficient of z”/m! is equal to Im(fm). Moreove 
(E.2.6) pu 


m—2k 
ee | oies | | h aB) 


m—2k k 
Saar is | Í h dB) | | h? as) 


and 


So we have 


DT : 
Lata | -;) (m — 2k)tk! Sm-2 (TT fm), 
O<k<m/2 


which is (E.2.4), and the other formulas (E.2.3), (E.2.1), and (E22) 
follow. 


E.3 Isobe—Sato Formula 


Let B; = (Bi, Pr ,B4) be a d-dimensional standard Brownian motion with 
respect to the filtration (.7,). We consider a stochastic differential equation 
in Stratonovich form 


d 
dxi) = X oX) o dB? + oX) dt 
Q=] 


X (x) =X, 


HEAR 


where X;(x) = (X}(x),...,X?"(x)) takes values in R”, and the coefficients 


| ; ee , a to 
Oy, Op i= 1,...,n are C”, Equation (E.3.1) is a rewriting of the 
equation 


| 
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J 


„MBO! = 
{ d 
i dX,(x) = > Sa X (x) dB} + G(X dt 
a=] 
£3 2) Xo(x) = x 


à ai = OE) +3 Lou Zlato (xo (x. 
with re extension of Ito's theorem [Ikeda and Watanabe, 1981, Chapter 
Theorem 3.1] equation (E.3.2) can be shown to have a unique strong 
ee n (cf. Ikeda and Watanabe [1981, Chapter IV, Definition 1.6] or 
S° ers and Williams [1987, Chapter V, Section 2], or Revuz and Yor 
se Chapter IX]). If, in addition, the coefficients of equation (E.3.1) 
[1 pally Lipschitz, the solution ([Ikeda and Watanabe, 1981, Chapter 


Jo 
A Section 6] defines a strong Markov process with generator A given by 


1 d? 
AfC = 5 > aij(x) 55 (x) + > bi(x) To, 
ij i i 


where 
a = >, Fala) and b,(x)=Gi(x) and fe ZAR’). 


Let Ag, be the field of first-order differential operators defined by 


n l d 
Afo So az1 
i=l 


and 


n | of 
Ao f(x) = >, 9) a7 
i=l 


Then we have the following lemma. 


Lemma E.3.3 Vf € © A(R") 


d 
Af- LS AalAaf) + Aof 
(1 


Y 
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Proof By (E.3.2) we have 
d . 
o=l 


and the lemma follows. Q 
We consider a smooth function h : R” — R and write the Wiener Chaos 
expansion of the random variable h(X7(x)), T € R., as 
AXT) = Ÿ° 


m=0 (OL, …., Om)E {1, ….,d}" Lo 


Fna, 1 h; (Sis O1), rc) (Sin, On) AB: > Bin, 


The Isobe-Sato formula yields an expression of the functions Fm in 
oe of the semi-group (P;) of the Markov process X and the Operators 
oe 


Fin, F, h; (51, œ), teeny (Sm; Om)) 
(E.3.4) = Ps Ag, P 55-51 Ady * * Pos Al Pose): 


As before, we explain this formula in the case d = n = 1. We follow 


the method sketched in Léandre and Meyer [1989]. 
Consider first any random variable f € L? with chaos expansion 


=>] Pm(S1,--+55m) ABs, +++ dB. 


m—-0 STS SS py 


If g, is continuous with compact Support on R,, we have 


(23.5) efs] 10 = | fig (dt 
) 0 


and, similarly, 
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gi" 
ep]. entr Ne Im) dB, nr dB;,] 
3.6) = ie fm(ti, sey tmSm(t1, HE >tm) dt, se -dtm 
7 " 


| continuous with compact support in the sector Am = {(tis--<>tm) 
ye `.. < tm}. Thus the map 
îs 1 


pi efef Balises ta aby, dB, 
<< Tm 


sa bounded measure on A,,, absolutely continuous with respect to 
dtm and with density fm(t),...,tm). It is convenient to write this 
in the symbolic differential form : 


define 
dti 
property 


3.7) ELf-4By, `- dB] = fmi» ---» tm) dti +++ dtm: 


Let us return now to our diffusion process (d = n = 1) and compute 
using (E.3.7) the chaos expansion of h(X7(x)). 
Beginning with the first chaos (m = 1), we have 


F(x, T, h, t)df = Elh(Xr(x)).aBy,] 
= ELE(AXr(x))| Fi, dB] 
= EfPr-1hXn @D)-dBnl. 


Moreover, Ito’s formula gives 
ti | 
Pr- h(X1,(x)) =Pr-n h) + f (Pr-n AY (Xs(x)) OX s(x) dB: 


: [ (Pr hy XX) ds 
0 


gå | Pra hy” Xs (Xs) ds. 
: 0 


X . We get 
We use this relation to compute [Pr W(X QO)-dBr] 8 


vY 
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E[Pr-1h@x).dB;,]1=0 


E | Í ” (Prin h Ka + GA (Dds.dB | =i) 
0 


Thus only 


E[Pr_,,h(X,, (x)).dBr,] = E | | (Pra hy XNA) 48.48, 
0 


= E[Pr-1h) (Xr, OX @)) dt] 
= P,6Pr-nh) dt. 


remains. 
And finally, 


FACE, Th, tı) = P,,AP7-1,h(). 


For general m, the computation is similar. We evaluate E[h(Xr(x)) dB, 
«s dBi J for ttm <T using the facts that dB,,, , is F;,,-Measur- 


able, dB,,, , is F;,_,-measurable, and so on. 


E.4 Expansion of the Solution of a SDE as a Series of Iterated 
Stochastic Integrals 


For notational convenience we will put 
0 _ 
B=; 


For J =(j1,...,jm) € {0,...,d}”, we define the integrals 


s= [| odB}'... o d h 
O<t << ty <t | 
and | 


O<ti << ty <t 4 | 


than nc 
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d 
£.4.1) dX, = On(X a 
(E 2 al) o dB; Nas R’, 


suppose 
where we ppose now that the coefficients can b 
series in a ball with center x = (x'). Our wie © expanded as power 
i 


e è N A 
, as à series of iterated stochastic ; to represent the solution 


nte 
auxiliary equation grals. For this we consider the 


d t 
= 
(E.4.2) ea >] Gu(XS) o dB%+e[ o (X®)d 
. a=] “0 0 by cee 
Let us write the Taylor expansions at the point x 


Gal) = > ay x) MED E 


VEN” 


where, as usual, yY = (y!)Y!---(y")” for v = (Vi,..., Vn) € N” and y = 
(y1,...,Yn) € R”. Here the coefficients af take values in R”. 

Let us suppose that the process X; can be expanded as a power series 
in € with coefficients that are processes Y} with values in R”: | 


(E.4.3) xt= Sey} with Yp=x. 


120 


In differential form, the equation (E.4.2) becomes 


V 
Year! A SSe (xe) o dB} 


V 
(E.4.4) +e? za( zer) dt. 
ve N” 


i2\ 


i fg izd 2... WE obtain 
Now we identify the coefficients Of € ; ; 
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i=0: aya 


a ae 
i=1 dy! = X oal) o dB? 
o=1 


d 
i=2 d? DD o dB* + G(x) dt 


o=1 |v|=1 
d 
i=3 dř De (Zao + > a¥(¥;)” | o dB? 
o=1 \|v|=1 |v|=2 
+ bite) dt, 
|v|=1 
and so on. 
More precisely, if bj,...,b are points in IR", we consider the poly- 


nomial P(z) = biz + --- + bz” for z e R. Let C$(bi,..., bm) be the 
coefficient of z* in the formal series 


Salt + P(z)) = X, air +-+- + bz)”. 


VEN! 


Then our identification can be written 


d 
dY! = $ ou) 0 dB? 
œ=] 


d 
dY? = SCT) © dB? + Go(x) dt 
(E.4.5) =l 


d À 
AVP) CE tor) ogre 
a=1 


CT eeu YP ae 


We note that the equation for Y?” depends only on Y!,...,Y ae Thus 
the system of equations which gives Y” can be solved by successive 
stochastic quadratures, and so Y” appears as a finite sum of iterate 
stochastic integrals. 


AÀ 
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OL 

MB 

5) 


| 1 
. a Q W t 


ch amounts to Count stochastic inte 
ie we have the following proposi 


grals for 1 and dt- 
( 


integrals for 2). 
tion: 


proposition E.4.6 Let (y™)i 


be the j-th Component of the process Y”. 
We have 


ry = >. Pi (a2)! 


MES 


the PÍ, are universal polynomials with rational coefficients in the 
where 8 
coefficients ay of the Ga. 


Proof We shall only sketch the argument. We call IJ || the order of 
ro 

1 ST. ; à . , la 
g a n applications of the chain rule Sere - e 
oo integrals) we see are | i 1 : rae re 

hastic integrals of order ||/1 Fe: 
rs a that, by the identification (E.4.5), Y; is a finite 
argument on | : 
a of stochastic integrals of s A > cy! which represents X, h 
in the series A; = 

(In other words, 1n 

degree of € is the order of Y;). 


es 
Corollary E.4.7 For every s > 0, the process 
LED peo 
l m/2ym) and (Ysp 
(V/s Yo 5227352 f 


have the same law. 


j of the Brownian 
ce of the scaling property 
Proof This is a consequen 


motion 


(Bso ~ (\/sBy)r20 


and the previous representation. 
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E.4.8 Example | 
Let us consider the equation 


dX, = Xr. © dB,, Xo = X, 


which, in Ito’s form, gives 


Bie Biase? 5X dl. 


The Stratonovich form, which follows the ordinary rules for change of 
variable, immediately gives the solution 
X= xe”. 


The formal expansion 


x= | o dB,,-:: © dBi) 
O<t) <- <tn <t 


converges for every t. 


E.4.9 Example 
Consider the equation 


dx, = X1 O dB, 
{ Xo = 1. q >2 
We obtain 


| | I/(q-1) 
Xı = _—— 
l- (g - zz) ‘ 


which can be expanded as a power series of B, by composing the expan 
sions of 1/1 — x and (1 +.x)*. For q = 2, the expansion is 
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1+B,+...4 pk 
FT Be TOP 
; for fixed t, with positi = 
iverges» pes ve probabilit k 
ee converges for t strictly smaller than the pee Ge Ka 
S ime T = inf {t = 


gil = 2 


APPLICATION OF THE SHIFT MET 


HOD TO 
WIENER INTEGRALS AND TO SOLUT MULTIPLE 


IONS OF SDEs 


the shift method was given in a general setting in Chapter 3, Section 
AA. It is based on Birkhoff ‘S pointwise ergodic theorem. This method 
is especially interesting because its implementation enables us to store 
intermediate computations. We investigated the rate of convergence of this 
method in Chapter 3, Section A.5. Not every function in L? satisfies a law 
of the iterated logarithm, and in Chapter 3, Section A.5.3 we introduced 
the subset of L? called the Gordin class where an L.LL. holds. 

Our purpose here is to investigate the application of this method to 
computing the expectations of functions of multiple Wiener integrals or 
solutions of stochastic differential equations of the Ito type, and to examine 
whether or not these functionals belong to the Gordin class. 

We use the notation of Chapter 3, Section A.5.3: 


(E, €, u) is a probability space and (Q, ~, P) = (EZ, FS, u95). 


The coordinate functions from Q to E are denoted by Xn. We define 
the ergodic automorphism T on (Q,.4, P) by 


| Xot Api Yn € Z. 
| We call t the shift to the right. We put 


| <ne Z 
| F” = GX ms Xm» > Xn) for msn 
F" = (Xp, ksn) 

| Fi? = o(Xx,k 2m) 

FT = GX. k € D). 


oo b 
Let T be the operator on LF o> p) defined by 


Tf=EUlF411°* 


> i 
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We denote by the Gordin class (cf. Chapter 3, Section A.6), 

To apply the results of Chapter 3, Section A.6, we must write the Wi 
space as a product space on which the shift operates. We can do fa 
several ways that give different results. We first study a factorization | 
which the shift appears as a scaling transform on the Wiener space. 


F1 First Factorization of the Wiener Space | 
Consider the space W = {f € @([0, 1], R“),f() = 0} equipped with its 
Borel o-field .⁄ and with the Wiener measure m. 


On the space (Q,. 46, P) = [We ny Mn), Where Wim, 


are copies of (W,.#,m), we define a Brownian motion (B;};er, as follows 
Let X, be the coordinate map from Q into W, and for f e]1 Joe 4 /X 


and k e Z we put 


B; = Sur D72 DA+D/2 


= a 


n=k+1 


The process thus defined is Gaussian centered with independent incre- 
ments, and tends to zero as f goes to zero. Its covariance is easily computed 
to be s a t times the identity matrix ; it is therefore a standard R?-valued 


Brownian motion. 
The transform T defined on Q by 


XnoT=Xn] ne Z 


is a scaling: 


B,ot= B>,. 


V2 
The results of Chapter 3, Section A.6 apply with 
Po = O(Bs,58 1) 


Fi =0(B,5<1) 
Fy = O(Bs - By isis € J1/2**', 1]). 
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p Fun ns of Lipschitz SDFs 
we consider maps 


o : R” x R, — R™d 
and b:R"% R 
| +> R” 
isfy the Lipschitz hypoth : 
ye m 0 such that Vs e (0, 1]Vx, 


lo(x, s)| + |b, s)| < C(1 + |x|) 


lo, 5) — Oy, s)| + |b(x, s) — b(y, s)| < Clx - y| 


where |.| is one of the equivalent norms on the Euċli 
Let X% be the solution of the SDE uclidean spaces. 


t t 
X =x | 00,9 4B, +| b(X*,s)ds xe R”. 
0 


Proposition F.2.1 Let f = h(X*) for t < 1 with A : R” — R Holderian 
with exponent A € ]0, 1]. Then f € G. 


Proof Let A be the Hölder constant of h: 


AG — h(y)| < Alx - yl, 


and let (P,),4 be the semi-group of the Markov diffusion process associ- 
ated with the solution X7. By classical estimates [Kunita, 1982, Chapter 


. ugy xy À 
\P,A(x) — Puh) = A(X y) — ERX?) <AE|X* - Xi 


< K|x - yl" Yu € [0,1], 
; jons M i and on the 
for some constant K depending On the dimensions M and d anc 
Constants C and A. | 
f ® is Hélder with exponent A, We have 
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Var [o(X*)] = E|@(X%) - EX) < Elo) - pE(X*))2 
< C,E|X* — EX*|?*sC.(1 + xls 


(see Kunita [1982, Theorem 2.1)). 
Now, we put Ø, = o(Bs,s < t) and remark that 


T'f = ELAX | Pr] ov" = Prop hX jn) o T". 


Hence by the preceding estimates we get 


IITE — Ef)||? = Var [T"F] = Var (Papen (KT pan)] 
1 


<c(1+ |x|?) Fah 


and the series X. ||T7"(f — Ef)|| converges geometrically, which implies 
the result by Lemma A.6.4 of Chapter 3. 


Proposition F.2.2 Let (ds, dx) be a bounded measure on [0, 1]x R” 
such that 


| (1+ [x|*)Iul(ds, dx) < +00 
[0,1]x R” 


with A e]0, 1], and let g be a Hôlderian function of exponent À. Then the 
functional 


J -| 8(X5)u(ds, dx) 
[0,1] XR” 


belongs to :Z. 


Proof We have 


\ 


nm EEO 
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1/2" 
T"f -| | 9(X%)u(ds, dx) o t" 
0 R” 


1 
+ | | Puma im) (ds, dx) 07 
1/27) Rm 
and hence 


1/2” 
ons | f ND- acpi 


1 
+ | (Var Pam g X172)" dyl. 


1/27) RM 


Therefore, by the estimates used in the preceding proof, 


1/2" 
Ing - EN) f | AG + Hu ds, a) 


1 

1 

Mn. 
[8e + RD Sak |u| (ds, dx). 


By our hypotheses the second term is bounded by C(1/ 2”/2) For the first 
term, we remark that | 


DAT ONE 


n=0 


is bounded on s € [0, 1], from which it follows that 


DIT" — EP < +. 
The result follows by Lemma A.6.4 of Chapter 3. L 


F, , 
3 Functions of Multiple Wiener Integrals 


ends 
y ee is the same as in Sections F.1 and F2. 
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r-[ Mtr, -- -s tm) 4B dB + din, 
m 
O<t1 <i <tm <1 


where i; e {1,2,...,d} for k=1,....m with 


| h?(t1,...,1m)dtidt*""dtn < +0. 
O<t]<:<tm<l 


It is easily seen that 


oi ty bin me dt . 
np _ ren #4: tm il JRİ2 ... JRim 
T'E = I. Luce à MP ho dB} dB? ...dBin 


Therefore F belongs to the Gordin class if and only if 


N 4 2 
sup | rx) dtd ty ---dtm < +o. 
N O0<t) <--<tm<l 6 2 me Ze Za 


F.3.1 Example 
Take m = 1, h(x) = 1/x% and a < 3. It is easily seen that 


l] _ 1 
p=] Gabe $ vas: 


F.3.2 Example 
Let us take 
l 


WA e 
(x) NEPE 


1 
ith =, 
wi B>; 


Then 
1/2 1 
F = —————— dB 
o vi- log 1 | 


is in the Gordin class if B > 1, but F is not in & ifBe]!, 1] although ‘à 
that case h e L?2[0, 1]. í 


semen 


asemassaan su 
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h(x) = | Sin (log, x) 
X log € ? 


then the functional F = le A(t) dB, gives 


g such that an example of a functional in 


D IT'FI| = 400 
4 


and 
1/2 
| |h(t)|aB, i 
0 
ig not in &. | 


F3.4 Example | 
We consider a real Brownian motion (d = 1), and a square integrable | 
function F with the following Wiener chaos expansion: | 


F=Fot+ > Fm | 


i m 
-Fo+ > | hm (tt, .. 29 tm) dB: dBi, ++: dB. 
m=1 * 0 


<tj<<tm<1 


ay” oo . . 
Suppose that m(ty, »-+5tm)| S$ dmt +++ la y with EVI = lyse ght 
Since the chaos are invariant by T, we get 


50 that, if L Fe% the series 
that, if all a” are equal to @ < 3: E E 7 whenever 
i q 2 


N 2 8e EN ê | 
STE - Fo) => ST" Fn 
n=0 m=1 n=0 | 
co 2 | 

S$ m i | 

m=] [1 B gotts-sott-m/2p] | nen 25 op : 

i=| kel i 


wW 
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am 


m=1 


converges. 


F4 Other Factorizations of Wiener Space 


Let (Xn(¢))n20 be an orthonormal basis in L7[0, 1], and let On(t) ~ 
i Xn(s) ds. Let (gn)nz0 be a sequence of independent standard Gaussian 
variables constructed as the coordinates of (Q, , P) = (R BR) 


N(O, 1)®%). The series 
(F.4.1) SY gn9n 
n 


x 


converges in %([0, 1]) a.s. and in Z ((Q, Æ, P), €([0,1])), p € [1, of, 
and its sum is a Brownian motion under P. 

Indeed, if we put ¥,,(@) = i Xn(s) dB, on the Wiener space and Z, - 
O(X4, k < n), then, writing B for the identity map from @ [0, 1] into itself, 
we obtain 


(F.4.2) EBA I=} Go, 


k=0 


which can be seen by applying a continuous linear functional u on & (0, 1] 
to both sides of (F.4.2) and remarking that (U,Y%o,...,%n) is a Gaussian 
array. By the convergence properties of vector martingales [Neveu, 1972] 
we therefore have 


(F.4.3) B= Se 
k=0 


a.s. and in L’. Since the family of partial sums of the series (F.4.1) has 
the same law as the sums of (F.4.2), the assertion is proved. U 
Representing the Brownian motion as 


B= ET 


allows us to define the shift, and the associated Gordin class clearly 
depends on which basis (4,) we chose. 
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f Haar functions is j 
e case O Particularly ; 
- 1pi/2.t and Y interesting. Let us put % = 
10/2 
Xm,k(t) = 2/2 (2m; ~k) 
t 
Pm,k(t) = f Xm,k(s) ds 
orte R, m E€ Z, andke N. 
á functions (Xm,k)meZ,ken are an orthonormal basis for LERSE eai 
m standard independent Gaussian variables, the Brownian motion can be 


B(o, t) = ` (È DEC) ) 


m=— oco k=0 


md the scaling studied in Sections F.1, F.2, and F.3 is the mapping that 
transforms the sequence 


(2m,k(@))m,k 


into the sequence 
(2m-1,k(O))m,k- 


The space generated by the functions (Ym,k)m>0,0<k<2" is the subspace of 
L?[0, 1] orthogonal to the constant functions, and the process 
œo 2M] 

(F.4.4) Z: = >. D Om,k(O)8m,k 

m=0 k=0 
is a standard Brownian bridge vanishing at zero and one. The represen- 
lation (F.4.4) is unique and converges in Gol, 11,= {f € (0, 11,f(0) = 
f(1) = 0}. The functions Qm, constitute a Schauder basis for this space. 
ffe Co[0,1] with 

oo 2-1] 


(F4,5) fW = ` 5, Pm klam), 


m=0 k=0 


then 


, k+l m/2 
wo A (k\- (45*) |e" 
minfa Leger) C3) 2 
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x 
x; 
x 
t 


om map 


Figure 5.1 The flow of a Lipschitzian stochastic differential equation is the rand 
(t,x) — X7(@) where X7 is the solution of 


7 t 
Xp =x+ | o(X;)dB,;+ | b(X\)ds. 
0 0 
' : and 
This map is regular with respect to x (smooth if the coefficients o and b are smooth) y 
irregular (as a diffusion path) with respect to ¢. The picture shows the graph of the m’ 
(t,x) — X; (œ) with hidden parts. 


Y 


& 


cat 


j of Hölderian f i 
h space erian functions wi 
panac ; Ns with expon TONE 
| T to Gol, 1], can be interpreted in terms ee 10, 1[, which 
| be aces (2"%am,k)mk [Ciesielki, 19601. and co on the 
seq ching a continuous function b . 
Appr oa 9y à partial sum (F.4.5) is in ti 
savenient, and it we change the notation by putting azm, = am ne se 
=0,...2" — 1 the Simple shift on the a, that is, the transform 
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a T E OP ACTE sets aie) 


(which does not correspond to a scaling) is quite thrifty in random draw- 
ings: | 


Notes 


CHAPTER 1 


The preliminary material contained in Chapter 1 is covered by a numbe 
of works, among which we mention Breiman [1968], Feller [1966) 


Shiryayev [1984], Bouleau [1988], Dacunha-Castelle and Dufio [1983] 
The main results of Chapters 2, 3, and 5 are presented in a Simplifieg 


form in the article by Bouleau [1993]. 


CHAPTER 2 


It seemed to us that it would be of interest to gather in a single chapter 
all the material relevant to Monte Carlo and quasi-Monte Carlo methods; 
this was partly because both methods rely to a great-extent on the same 
mathematical concepts, and partly because they are often in practice inter- 
changeable. A discussion of their relative advantages is given in Sec- 
tion C.5. | 

It seems likely that future work will produce pseudo-random sequences 
for which certain asymptotic properties have been proved in the same way 
as for uniformly distributed sequences, with properties other than uni- 
form distribution (asymptotic independence, etc.). From the point of view 
of the numerical analyst, it will be possible to choose from a variety of 
approaches according to various criteria: precision, speed of calculation, 
provability of results, generality or Specificity of method, and so on. | 

Section A.J. For more detailed investigation, a proof of Prokhorov’ 
theorem (A.1.6), and applications of narrow convergence to random pre 
cesses, see Billingsley [1968] and Shiryayev [1984, Chapter 3]. 

Section A.2. An explanation of u-Riemann integrable functions © 

given in Schwartz [1970] and Bourbaki [1965 (2), Chapter IV, §121. 
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em concern 


; ohod [19 
Section A.4. See Breiman [1968] fo [1965]. 


r a treatme OIE 
tons- nt of distribution func- 


Kuipers and Niederreiter [1974]: see 
[1976] (in particular, for other applications of Weyl’s lemma) 

Section B.I. For a discussion of Menten cee 
Dellacherte [1978], Knuth [1981], and Martin-Lôf [1966], The te = 
nology “Borel sequence” for a sequence that is A eee 


ad Lid. wi j 
on the cube is introduced here with reference to t with uniform law 


i | he works of Emil 
[1958]. There is a large literature on simulation E EE oaiae 
[1986], Bouleau [1986], Rubinstein [1981, 1986], Ripley [1987] Knuth 


[1981]—and we refer to this for techniques for reducing variance, which 
as mentioned in the introduction, we do not deal with here. There have 
been some recent improvements in these techniques, in particular for prob- 
lems of particle transport in physics, where large deviation methods allow 
optimal reductions in variance for certain types of event; see Buckley 
[1990], Hahn and Jeruchim [1987], Cottrell, Fort, and Malgouyres [1983], 
Yarmolik and Demidenko [1988], Dagpunar [1987]. 


Sections B2, B3, and B4. The theorem of Glivenko-Cantelli is given, 
for instance, in Dacunha-Castelle and Duflo [1982] and Bouleau [1986]. 
An accessible treatment of the generalizations to functions of the central 
limit theorem, usually known collectively as the principle of invariance, 
is given in Breiman [1968]; see also the survey of Gaenssler and Stute 
[1979]. The intuitive meaning of the most important historical and current 
works on the speed of convergence of empirical quantities is explained in 
Deheuvels [1990]. The theorems of Donsker and Chung-Kiefer, quoted in 
Sections B3 and B4, which are concerned with the asymptotic properties 
of empirical discrepancies for i.i.d. random variables with uniform law 
on the cube [0, 1]', have been addressed in a number of recent works, 
which extend the results to independent identically distributed Banach 
space valued variables, replacing the class of indicator random variables 
on which the closure is taken with more general classes (see Dudley [1984] 
and Pollard [1984]). For recent results on the properties of ue 
tributions we mention Shorack and Wellner [1986] and Deheuve ae 
Mason [(1) (2) (3) (4)]. Theorem B.4.2 for the case s = | ne es 

9]; it has been generalized to other forms pes e mri 
(D (2); Shi, 1990]. The case for dimension $ was p ane à nee 
[1961]; see also Cassels [1951] and Zaremba ee 
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and related problems see Csürgô and Révész [1981] and Gaensgsle, _ 
Stute [19791]. complete presentations of algorithms 


For more 
Benvéniste et al. [1987] and Dufio [99 


Our account follows Bouleau, Lamberton, and Lapeyre [1986], Where 
the practical importance of stopping conditions 1s emphasized. For more 
general assumptions than those of Theorem B.5.1, see Métivier [1983] 
Benvéniste et al. [1987], and Métivier and Priouret [1987]. The go,’ 
ping condition B54 is taken from Dvoretzky [1956]. For the speed of 
algorithms of Robbins-Monro type see also Komlos and Révész [197% 


Benvéniste et al. [1987], and Lapeyre [1988]. E 
Algorithms for stochastic approximation have applications in ve 
diverse areas (recursive identification, signal processing, adaptive control) 


see Benvéniste et al. [1987], as well as for shape recognition and for max;. 
mizing a function, as in Kiefer and Wolfowitz [1952], which has been 
followed by a number of works in the spirit of “simulated annealing algo- 
rithms.” 

There is continuing research aimed at using arithmetic sequences 
in these algorithms; see Niederreiter [1983, 1988], Niederreiter and 
McCurley [1979], and Lapeyre, Pagès, and Sab [1990], or in “self-organ- 
ising” algorithms, see Bouton et al. [1992]. | 

Sections C.l and C2. For a survey of low-discrepancy sequences as 
of 1978, see Niederreiter [1978]. 

We have given a simplified form of the Koksma—Hlawka inequality 
[Koksma, 1942; Hlawka, 1961]; the idea for our presentation was sug- 
gested to us by G. Pagès. A more elaborate version, which takes into 
account variation in the sense of Hardy and Krause, is given in Kuipers 
and Niederreiter [1974], on which we base Section C2. 

Section C.3. This section is standard material from ergodic theory. For 
the proof of Remark C.3.5, see Cornell et al. [1982]. 

Section C4. Lapeyre-Pagès sequences are studied in Lapeyre and 
Pages [1989]. Faure sequences [H. Faure, 1982] are an improvement upon 
ideas of Sobol’ [1967] and have been further developed; see Niederreiter 
[1987] and H. Faure [1990]. For more recent developments see Borel et 
al. [1992]. 

E n Cc | and C .6. The circumstances for using low-discrepancy 
sequences in simulation, for finite dimensions using the rejection methot: 
are outlined in Bouleau [1986]. 

Psi ese ii deterministic low-discrepancy sequences depends on the 
ae s shown and explained in this chapter, but the same applies 
andom (Borel) sequences, notwithstanding th re ic theorem 

Chung-Kiefer, as shown by G. Papà ar ME soe 
n by G. Pages (private communication). 


Section B.5. 
stochastic approximation see 


jon C6 18 based on Bouleau [9] = 
ection D1. This is standard mater: 


section D2. Expansion as ort 
nethod for approximation in L?, see 


te grad 
a = Patterson [1974]. 


sections D.3 and D.4. This section is 
she fact that the methods using polynomi 
asitivity, unlike Haar bases, which are u 
‘s shown in Neveu [1972, Chapter II, 83]. 

ection D.4 is based on Bouleau et al. [1989] 


CHAPTER 3 


Accessible accounts of the theory of stochastic processes from the point 
of view of applications can be found in Dacunha-Castelle and Duflo [(1) 
(2)], Soize [1986], and Bouleau [1988]. 

Sections A.l to A.6. These sections largely follow Bouleau [(5) (6)]. 
For other results, which are sometimes easier to use, and bounds for speed 
in the shift method, see Ben Alaya [1992]. 

Section A.7. For more detailed investigations of p-uniformly dis- 
tributed sequences, see Rauzy [1976] and Bass [1984]. The proof of the 
proposition concerning Wey! polynomials in A.7.2 is taken from Rauzy 
[1976]. 

Section A.8. The theory of Dirichlet forms and their probabilistic 
interpretation comes from Fukushima [1980] for the finite-dimensional 
case and Bouleau and Hirsch [1991] for the general case. 

Sections B.1 to B.10. Stationary processes are dealt with in classical 
texts such as Skorohod [1965], Breiman [1968], Shiryayev [1984], Rosen- 
blatt [1985], Bouleau [1988], Preumont [1990]; for discrete-time processes 
see Azencott et al. [1984]. For accounts that are oriented more toward 
applications see Lin [1969], Preumont [1990], and Bouc [1991]. a an 
extension to processes that may not be stationary but admit a spectral rep- 


resentation see Rao [1986]. . m 
Among the Rei E on the simulation of stationary ee n 
11979), Shinozuka and Jan [1974]: Yang and Lin [1 


agner [1986]. 
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e of the more recent works on 
Se See processes is Dellacherie, Maison 
ntains the standard references on w ; 

processes. See also Meyer [1967], Getoor [1975], Williams [19791 
Fukushima [1980], Dacunha-Castelle and Duflo [1983], Shiryayey [19847 
Bliedtner and Hansen [1986], and Bouleau [1988]. The links betwee, 
Brownian motion and potential theory (harmonic and subharmonic fie. 
tions) are elucidated in Port and Stone [1978] and Durrett [19841 from 
which our treatment is partly drawn. | 

Sections D.1 to D.8. Processes with independent increments are dealt 
with in classical texts such as Feller [1966]. Here the accent is placed 
on the links with convolution semi-groups, for which see Berg and Forst 
[1975]. For the probabilistic interpretation of Lévy measure, see Bretag. 
nolle [1973]. | | 

The mathematical theory of stable processes 1s as general as that of 
Gaussian processes; see Marcus and Pisier [1983]. The formulas attained 
in Lepage et al. [1981] allow approximate simulations to stable processes. 
For the simulation of stable processes see also Boulanger [1990]. These 
sections are apparently the first account of the use of subordination in the 
sense of Bochner [1949] in simulation. 

Section D.8 follows Bouleau and Chateau [1989] and Chateau [1990], 

Sections E.l to E4. Our presentation of point processes follows 
Neveu [1977]; for an application to traffic models see Bouleau [1988]. 
For applications in geostatistics and the estimation of laws see Matheron 


[1970]. 


Sections 
mathematical theory of Mark 
and Meyer [1992], which co 


CHAPTER 4 


References on the theory of Markov processes have been given above in 
the notes for Chapter 3, Section C, and references for the various sections 
of Chapter 4 are mentioned in the text. For the links between problems 
of réduites and the mathematics of finance, see Lamberton and Lapeyre 
[1991]. We give here a sketch of the historical roots of the theory. 

The development of the theory of Markov processes is marked by the 
increasing generality and refinement of results. The main stages include: 


e Kolmogorov’s discovery [Kolmogorov, 1931] of precise hypotheses 
under which it is possible to establish explicit links between arkov 
processes on R” and elliptic and parabolic partial differential equa- 
tions. Bachelier [(1) (2)] had expressed these links in ‘an intuitive 
fashion. 


APTE ° 


| The interpretation of Newtonian S 
sores of Brownian motion, begu 
| Hunts construction of a potenti 


: al 
Markov process (using Choquet ea starting from a general 


me ’S definiti i 
the measurability of entrance times). In nae Of capacity to show 
Way, as Meyer, Bau 
, Bauer 


and Hunt have shown, potential theories are : 
Markov processes. © Interpreted in terms of 


re is a great number of results or 
nie framework of standard ee Poe hee the Blumenthal and 
and Getoor formalization of the very general M a Meyer-Walsh 
cesses, and the study of diffusions, that is. of M. fae ae 
ic differential equati SOS 0 

a p quations, notably by Stroock-Varadhan and Ikeda- 

For a more detailed history, and references to the works mentioned 
above, we refer to Bouleau [1987] and the notes of Dellacherie and Meye 
[1983, 1987], as well as Dellacherie, Maisonneuve, and Meyer [1992]. 

The theory of Markov processes has recently been revived by: (1) the 
fne study allowed by the discovery that stationariness is given by an 
` appropriate choice of entry and exit laws, see Dellacherie, Maisonneuve, 
and Meyer [1992]; (2) the developments in infinite dimensions, notably for 
Ornstein-Uhlenbeck processes in Wiener space, using Malliavin calculus, 
see Bouleau-Hirsch [1991]. 

Our treatment in Section A.3 of the resolvent family indexed by func- 
tions comes from Neveu [1972]; see also Revuz [1975]. 

For applications, the book by Kree and Soize [1983] applies the tech- 
niques of markovianization of stationary processes to the calculus of struc- 
tures under random actions. 


CHAPTER 5 


al; for more detailed accounts | 


Sections A.] to A.4. Section A is classic 
: Stroock and Varadhan [1979], 


see Revuz and Yor [1991], Protter [1990], 
Ikeda and Watanabe [1981]. 

Sections B.1 to BA. In Section B we 
results about the discretization of an EDS. T 
Ject of much work; see Talay [1990], O. Faure 
Te Kloeden and Platen [1992]. For an instructive 

88]. 


have given only the simpler 
his problem has been the sub- 
[1990], Bouleau and Talay 
account see Gard 


Nualart and Zakai [1990], 


nase The general char- 


Sec ) ~f] C.1 f 
ctions C.] to C.4. Section Is can be found. 


Where deeper results on multiple integra 
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es from the fact that the more complicateg : 

ed continuously in terms of it. The joint ies p 

on of a Brownian particle at time 1 was deriy 
by Paul Lévy [1951, 1965]. An integration by parts yields the joint ise 

(w(t), w(t), ft œ; (s) da@n(s), f (s) do (s)). See Gaines and Lyons [1997 
for a simulation method for these joint laws. 

For an EDS that does not satisfy the commutativity condition, Clark and 
Cameron [1980] showed that taking values of the Brownian motion at time 
intervals h only gives an accuracy of O( h). Faster methods are possipe 
using domain integrals (see Pardoux and Talay [1985] and Kloeden ang 
Platen [1992]) which can be simulated approximately or exactly. 

These irregularity properties of Brownian functionals provide motiva. 
tion for attempts to obtain laws for random variables from stochastic ca]. 
culus; see, for example, Yor [1992]. 

The property of approximate continuity at regular points of solutions 
of an EDS in the sense of Stratonovich was discovered by Stroock and 
Varadhan [1972]. See Ikeda and Watanabe [1981]. For improvements of 
this result and connected work see Carmona and Nualart [1991], Hargé 
[1993], Sugita [1992], and Fang [1991]. For a more detailed account than 
that of Section C.4, see Bouleau and Talay [1992]. 

Sections D.1 to D.5. We thank G. Caplain and W. Kendall for useful 
discussions on the definition of simulatable functionals. 

Sections E.1 to E.4. For the use in information science of the formal 
aspects of Ito calculus see Kendall [1991]. For a weakening of the hy- 
potheses of proposition E.1.2 see Meyer [1976]. Our account of the for- 
mulas of Hu—Meyer follows Hu and Meyer [(1) (2)]. 

Formal calculus for stochastic differential equations of Stratonovitch 
type follows the same rules as for ordinary differential equations. The 
equations can be solved formally, and the validity of the results can 
be established under certain assumptions; see Fliess and Norman-Cyrot 
[1982] and Hu [1990]. | 

The explanation of the Isobe-Sato formula is based on Léandre and 
Meyer [1989]. Section E.4 draws from the accounts of Azencott [1982] 
and Ben Arous [1989], which contain conditions for validity of the expan- 
sions. | 

Sections F.I to F4. These sections are based on Bouleau [1991 
are the result of discussions with J. P. Conze, E. Lesigne, and M. Ben 
Alaya. Representations of Brownian motion such as (F.4.3) are of great 
use for a number of detailed studies of Wiener space. They allow us (0 
see Wiener space as a sequence space; see Ciesielski [1960], Baldi et ?” 
[1992] and Kerkyacharian et al. [1991]. 
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